docs and warnings fixed

2025-07-03 18:53:35 +00:00 · 2022-09-10 14:05:45 +01:00
parent 7cb3adf147
commit 6bb5732c70
11 changed files with 160 additions and 126 deletions
--- a/docs/manual.md
+++ b/docs/manual.md
@ -1,17 +1,31 @@
 # Samplebrain Manual
-        
+
+A custom sample mashing app designed by Aphex Twin.
+
+Samplebrain chops samples up into a 'brain' of interconnected small
+sections called blocks which are connected into a network by
+similarity. It processes a target sample, chopping it up into blocks
+in the same way, and tries to match each block with one in it's brain
+to play in realtime.
+
+This allows you to interpret a sound with a different one. Over time
+developing it, we gradually added more and more tweakable parameters
+until it became slightly out of control.    
+            
 ## Brain tweaks:

+![](pics/braintweaks.png)
+    
 These settings control how the block search works.
    
 ### fft / mfcc

-Choose whether to search using FFT (raw frequency analysis) or MFCC
-(Mel-frequency cepstral coefficients) which are higher order paramters
-that attempt to model perception of sound. MFCC is usually a bit
-better, but it depends on what you are doing, you can blend between
-them to use a mix. Setting this to 0% or 100% switches off the other
-search option, so is a bit more CPU friendly.
+Choose whether to match blocks using FFT (raw frequency analysis) or
+MFCC (Mel-frequency cepstral coefficients, parameters that attempt to
+model perception of sound). Which is best depends on the sounds you
+are using, so you can blend between them to use a mix. Setting this to
+0% or 100% switches off the other search option, so is a bit more CPU
+friendly.
   
 ### freq & dynamics / freq only

@ -21,76 +35,85 @@ want the first option.
   
 ### fft subsection

-When using FFT mode you can select a subrange of the (100) frequency
-bins to use for scoring potential blocks, potentially allowing you to
-target a specific frequency range. Not terribly useful in practice.
+You can select a subrange of the (100) frequency bins we use for
+scoring potential blocks, potentially allowing you to target a
+specific frequency range. Not that useful in practice so far.
     
 ### novelty

-One thing that tends to happen is that the same block or set of blocks
-can be overused if there isn't enough variation in the brain
-blocks. Sometimes we want to bias the selection against reuse, so
-novelty biases the selection away from similarity - if you turn it all
-the way up it will ignore the target completely and just play the
-least used ones in some odd semi-random order.
+Often the same block or set of blocks tend to be overused if there
+isn't enough variation in the brain. You can use 'novelty' to bias the
+selection away from similarity, and prioritise similar blocks we
+haven't used yet.
    
 ### boredom

 This increases the speed at which novelty wears off, creating a wider
-spread of possible blocks. Not quite clear exactly why this is
-different to increasing novelty, but it sounds different.
+spread of possible blocks to be used.
+
+If you turn novelty and boredom all the way up it will ignore the
+target completely and just play brain samples in some odd semi-random
+order.
    
 ### stickyness

-If the error is under this threshold, play the next block in the brain
-rather than the closest. This will have the effect of elongating
-chunks of brain samples that you hear.
+If the error of the next block in the sequence after the one we have
+just used is under the stickyness threshold, we will use that rather
+than the closest in our search. This will have the effect of
+elongating chunks of brain samples that you hear.
    
 ### search stretch

 Repeats blocks in the target a fixed amount, like a simple timestretch
- in synaptic mode this gives the system repeated attempts to find a
-closer match.
+- using the above controls it won't necessarily repeat the same brain
+block, and in synaptic mode (see below) this gives the system repeated
+attempts to find a closer match in the network.
        
 ### algorithm

 * basic

-Searches all samples in the brain, and uses the closest match.
+Searches all sample blocks. Not usable with large brains.
    
 * reversed

-Searches all samples in the brain, and uses the least closest
-match. In practice this needs work, as it tends to select silent or
-very quiet blocks.
+Searches all samples in the brain, but selects the least closest match
+instead of the closest. In practice this needs work, as silent or very
+quiet blocks tend to be the least similar.

 * synaptic

-As brains get larger, we get more blocks, and they get slower to
-search. This mode provides a constant search time over arbitrarily
-huge brains. When generating the brains we connect them together into
-a network via similarity (via connections called synapses). We keep a
-position in the network and only search the nearby blocks - this
-assumes that sounds tend to change gradually, or at least more
-gradually than the small block lengths.
-    
+As brains get larger with more blocks, they get slower to search. This
+mode provides a constant search time over arbitrarily large brains. To
+do this we connect the blocks together into a network via similarity
+(via connections called synapses). We store a current "playback
+position" in the network and only search the nearby blocks the playing
+block is connected to. This assumes that sounds tend to change
+gradually, or at least more gradually than the smaller block lengths.
+
+This can leads to the output changing with each repetition of the
+target, as we wander around the synapse network.
+        
 * slide

-Similar to synaptic but if we can't find a close enough match (based
-on synaptic slide error) we stretch the target, repeating blocks until
-we land on a block that is close enough. This mode warps the timing of
-the target.
+Similar to synaptic mode above but if we can't find a close enough
+match (based on synaptic slide error) we repeat the target block
+again, stretching it until we land on a block that is close
+enough. This mode unpredictably warps the timing of the target sound.

 ### num synapses

-How many connections to check in synaptic or slide mode.
+How many connections to check in synaptic or slide mode, up to a
+maximum of 1000 - they are ordered by closeness.
    
 ### synaptic slide error

-The acceptable error to consider a block as "close enought" in slide mode.
+The acceptable error to consider a block as "close enough" in slide mode.

 ## Target sound:

+![](pics/targetsound.png)
+
 These settings control how the target sound is broken up into blocks.
    
 ### load target
@ -100,15 +123,16 @@ Load a target sound to try and match
 ### block size

 The size of the blocks in samples. This does not need to match the
-brain block size, but it probably should.
+brain block size.

 ### block overlap

-Percentage overlap in blocks.
+Proportion to overlap the block generation.
     
 ### window shape

-The shape of the window - "dodgy" is actually box.
+The windowing function for the target blocks, the volume shape given
+to them before use - "dodgy" is actually rectangle, so no shaping.

 ### (re)generate blocks

@ -116,12 +140,14 @@ Compute the target blocks.
    
 ### use mic input

-Attempts to stream blocks live from the microphone. I think this is
-broken at present.
+Attempts to stream blocks live from the microphone. This has not been
+tested fully.

 ## Mix:

-These are settings that happen after the search.
+![](pics/mix.png)
+    
+These are settings that control things happening after the search.
    
 ### autotune

@ -129,21 +155,24 @@ Attempt to pitch bend the chosen brain block to better match the target.
 
 ### normalised

-Mix in normalised brain blocks - removing all dynamics. Might work
-with frequency only search.
+Mix in normalised brain blocks - removing all dynamics. Designed to
+work with frequency only search.
        
 ### brain / target

-Mix in the target blocks to the output - for cheating, or testing purposes.
+Mix in the target blocks to the output - for cheating, or testing
+purposes.
    
 ### stereo mode

-Run everything once for left and again for right speaker.
+Run separate searches for left and again for right speaker.
    
 ## Brain contents

-These settings allow you to build a brain of samples, and switch in
-and out specific samples during playback.
+![](pics/braincontents.png)
+
+These settings allow you to build your sample brain, and switch in and
+out specific samples live during playback.
        
 ### all/none

@ -158,15 +187,16 @@ in one go.
 ### block size

 The size of the blocks in samples. This does not need to match the
-target block size, but it probably should.
+target block size.

 ### block overlap

-Percentage overlap in blocks.
+Proportion to overlap the block generation.
     
 ### window shape

-The shape of the window - "dodgy" is actually box.
+The windowing function for the brain blocks, the volume shape given
+to them before use - "dodgy" is actually rectangle, so no shaping.

 ### (re)generate blocks

@ -178,15 +208,17 @@ You can save and load brains separately to the targets.
    
 ## Lower bar

-General playback settings
-    
+![](pics/bar.png)
+
+General playback settings.
+
 ### play/pause/record/stop

-Start/stop and record
-    
+Start/stop and record.
+ 
 ### volume

-Global volume
+Global volume.

 ### load/save session

@ -196,4 +228,6 @@ Load and save the entire session.

 This allows you to control multiple instances of samplebrain over the
 network all running their own brains simultaneously. This feature has
-not been tested well!
+not been tested well.
+
+
--- a/docs/pics/bar.png
+++ b/docs/pics/bar.png
--- a/docs/pics/braincontents.png
+++ b/docs/pics/braincontents.png
--- a/docs/pics/braintweaks.png
+++ b/docs/pics/braintweaks.png
--- a/docs/pics/mix.png
+++ b/docs/pics/mix.png
--- a/docs/pics/screenshot.jpg
+++ b/docs/pics/screenshot.jpg
--- a/docs/pics/targetsound.png
+++ b/docs/pics/targetsound.png