The Virtual Rat

(formerly: "Extending the Ratlab Model")

This is a log of experiments conducted using the Catacomb simulation environment to test models of brain function in behaving animals, most commonly simulated rats.

Models:
  1. A Medial Temporal Model of Navigation in a T-maze
  2. Further Adaptations

  3. PFC Minicolumns: Path Selection and Planning

  4. Temporal Context

  5. Dendritic Modulation Mode Control

  6. Bartlett W. Mel Pyramidal Neuron Model

Presentations: (more)

  1. Exploring the Virtual Rat (PDF)
  2. STM in the Virtual Rat (PDF)

(Planned:)

  1. Episodic Memory Retrieval: A Forward-Only Method
  2. Context
  3. PFC Rule Encoding
Experiments:
  1. Figure Eight Learning
  2. Navigation in a T-maze: Images and Figures
  3. The Thorpe Task
  4. The W.Schultz Task

Collaborative:

  1. Nathe Collaboration Models
  2. Eichenbaum Collaboration Models
  3. Kantak Collaboration Models
  4. McGaughy Collaboration Models

General Investigations of Neuronal Circuitry:

  1. Neuronal Circuitry: Calculating Dynamics
  2. Neuronal Circuitry: Signal Synchronization
  3. Neuronal Circuitry: Oscillatory STM

(Planned:)

  1. Perturbation Tests
  2. Reversal Learning
  3. Neuronal Circuitry: Robustness and Reliability
  4. Updating Multiple Related Models

A list of all Catacomb models in the virtual rat project: ~/src/nnmodels/ccmb/*.ccm*


1) Figure Eight Learning

Problem Analysis for Figure Eight

Legend:
  • Significant functional observations that can lead to implementation rules for behavioural controls are indicated in bold.
  • Modifications to the model are indicated in italic.

Figure eight causes diagonals to be learned as well... this may be avoided by setting the plastmodthreshold (LTP threshold) on ECIII recurrent to 70% and by modifying their transmod function to have a phase of -40. These changes should help to disambiguate the encoding and retrieval phases of the network.

Doing this helped somewhat, but some undesired weights are still established. This happens once a complete circuit of the figure eight is completed. As activity returns to the orginal location, the cell at the top of the stem is reactivated and becomes associated with a cell at the bottom of the right-hand loop. Subsequently, as the rat begins the reverse traversal, other cells are associated in that manner.

[local-connectivity]This could be avoided entirely by having only local connectivity.

The first instance of trouble is around t=4160. Spiking of the cell at the bottom of the stem seems to recall the cells along the stem, which are subsequently associated with recently active cells. This all happens during the encoding phase. It is not related to the goal signal, which only appears in the retrieval phase.

The retrieval of the other cells in the stem should not be happening during the encoding phase (low theta), despite the level of ADP (which ECIII here does not even have) or other factors.

[transfer-modulation]Another obvious way to avoid retrieval during the encoding phase is to utilize GABA_B receptor modulation of recurrent synapses.

Look at the effect the transmod function has in ECIII. Check to see where activity occurs in comparison to that function. See how the function should be adjusted in order to properly suppress recurrent retrieval during encoding. It may yet be possible to add direct modulation of membrane potentials.

The falsely recalled ECIII activity in the stem occurs in the retrieval phase of the transmod function with phase offset -40. That is as desired, in that the transmod function should impose the mode on the recurrent synapses. Unfortunately, the peak of the retrieveoscneg40 function is very near the peak of the encodeosc14 function that is to link the activity of consecutive locations by modulating the afferent activity from ECIIcanm to ECIII. The previous function, RetrieveOscPos166, seems to have a much more orthogonal phase to that of the encoding function. So why was it replaced with -40? Setting the original function caused even earlier undesired connections to appear, since the phase was such that retrieval occurred at the top of the right-hand circle while encoding was going on at the bottom of that same circle.

Setting plastmodthreshold to 80 percent removed most of the unwanted connectivity.

[dendritic-modulation]Using a real dual-compartment model, or an approximation created by linking one cell body integrator compartment to another with a tube, dendritic function can be separated from cell body function. That allows an entire dendritic section to be modulated out of phase with the cell body, which can act as an all-or-nothing mode control. Recurrent synapses connected to the dendritic compartment can be effectively silenced when the dendritic compartment is modulated GABAergically.

From investigations of the cell activity in PFC, ECIII, CA3 and CA1, it is clear that multiple paths are simultaneously suggested, which causes the undecided behaviour of the rat. Even a random delay between transmissions from one cell to another could possibly avoid this. It is worth trying from a different initial position.

Moving from -40 to -50 for the retrieval oscillation helped somewhat, since starting from the middle of the stem (asymmetric with regard to the goal) again caused a few diagonals to appear. Diagonals were completely abolished by setting the maximum conductance that ECIII weights could achieved through LTP to 16.0 rather than 20.0. Unfortunately, that also led to a lack of retrieval, so that the rat did not move from its position when "run net" was attempted after training.

[disambiguation-conductances]Note for disambiguated encoding-retrieval implementation: Implement recurrents with a maximum conductance low enough to be ineffective during low theta modulation in encoding phases, but able to complete patterns during high theta modulation in retrieval phases.

This may be achieved in approximation right now by attaching an interneuron to ECIII that is driven by EncodeOscNeg14 or by attaching a hyperpolarizing theta oscillation at the same phase. The afferents from ECIIcanm must be sufficiently strong to cause spiking despite that theta modulation of the ECIII membrane potentials.

In the model figure-eight-theta.ccm, I have now added the stimulation node ECIIItheta, with the oscillating signal ThetaHypEC, which is modeled after the signal ThetaDepolEC, and about 180 degrees out of phase with the EncodeOscNeg14 transmod function. It will therefore provide inhibition during the high phase of encoding and excitation during the low phase of encoding. The conductance from the stimulus node, the conductance from ECIIcanm, and the maximum conductance that can be achieved by recurrents in ECIII must now be designed to work together. One way to do this is to first lower the recurrent maximum conductances quite a bit and concentrate only on making encoding work with the new ECIIItheta stimulation included. Then retrieval can be added.

To do this, I have now created the ECIIILTPparameters, in which the maximum conductance has temporarily been reduced to a tenth of its former value, namely 2.07.
I then reduced the conductance from the stimulation node to ECIII from 4.0 to 3.0 and increased the afferent conductance from ECIIcanm to ECIII from 20.7 to 30.0.
Reducing the plastmod threshold now should help by allowing learning to proceed more easily while retrieval is disabled. I have now set the plastmod threshold on ECIII recurrents to 50 percent.

ECIII recurrents  : E=0, Gmax=10.07, delay=2.12, Trise=1, Tfall=2
ECIIcanm afferents: E=0, G=40, delay=1, Trise=1, Tfall=2
ECIIItheta        : Emax=-30, Emin=-92, G=3, frequency=8
The rat seems to learn the track well now. I may wish to set the transmod function back to RetrieveOscPos166 if the addition of theta modulation in the membrane potential suffices to disambiguate encoding and retrieval. Logically, the transmod and plastmod functions should be 180 degrees out of phase, so if the transmod function is set back to RetrieveOscPos166, then the plastmod function should be EncodeOscNeg14. Interestingly, the name of the transmod function implies that phase, but does not actually implement it. Instead, it appears to implement a phase near 0. That may simply be a later (debugging) setting by Mike. It is perhaps a good idea to set that back to -14 as well. [plasticity-modulation]Note though that the plastmod function should not be necessary if the model is working robustly, unless there is actual physiological cause to believe that plasticity is modulated in such a manner.
I have now set the transmod function on ECIII recurrents back to RetrieveOscPos166.
[theta-power]Whatever the transmod and plastmod functions are set to, the theta phase provided by ECIIItheta must be set accordingly. Its conductance can be increased if disambiguation is still a bit fragile.
For the theta modulation to do its part it is important that afferent input from ECIIcanm arrives around the peak of the encoding phase, not during the retrieval phase. Afferent input currently arrives near, but not quite at the peak. If there is no way to move the arrival of the afferent input, then perhaps both the encoding and retrieval phases (transmod, plastmod and ECIIItheta functions) at ECIII should be shifted to match it. First, I must check some other ECIII spiking due to ECIIcanm afferent input. One special case where a spike does occur in the retrieval phase, even during training, is when the rat reaches the goal. Continuous spiking is turned on in that location in PFC and always appears during the high portion of the retrieval phase.
[dual-sequences]If the LTP function is narrow enough so that encoding spikes are not associated with retrieval spikes, the appearance of retrieval elicited by PFC (or any other retrieval, such as ongoing STM) should not interfere with encoding. Note that this may not be the same issue for sequence learning with a model that retains STM, if the PFC context is linked in differently (not through backwards propagation in ECIII). With this method of disambiguation, it might even be possible to have STM retrieval (with strengthening of episodes during retrieval) and encoding of another sequence going on at the same time!

Spiking throughout ECIII appears to fall within the high portion of the encoding signal (although some minor shifting may be possible to center activity at the peak more precisely). The only exception is the additional spiking of the goal location in ECIII, driven by periodic PFC spiking.

The encoding phase appears to be working properly with the new ECIIItheta modulation. Now the retrieval phase must be brought back online.

I'm setting ECIII recurrent Gmax to 20 as a test.
That was too strong, as all diagonals were again added. Either the recurrents must be sufficient at a lower Gmax value, or the conductance from ECIIItheta must be increased. ECIII output is of the proper strength, but the spikes must occur as desired. During encoding, only encoding spikes should occur. During retrieval, ECIII must be able to propagate retrieval through its recurrents. I must test retrieval with "run net" to determine the necessary modification ((a) ECIII recurrent Gmax around 10 or (b) stronger ECIIItheta).

I have set Gmax back to 10 and follow training with "run net".

Inspection of the "run net" result, in which the rat remained stationary, indicates that ECIII spikes at the rat's location occur during the trough of the retrieval signal. That means that the transmission of the spikes is inhibited by the transmod function. ECIII is supposed to implement (controversial) backward propagation of spikes from the goal location, which is driven by PFC. The goal activity still occurs properly near the peak of the retrieval signal, where the transmod function allows maximal transmission through recurrents and the ECIIItheta signal produces minimal inhibition. The recurrent conductances may need to have a greater Gmax. A possible way to do this, while retaining clean encoding is:

The resting potential of the "fastTau" cells used in ECIII is -70V. The ThetaHypEC signal drove their membrane potential between about -60V and -75V.

I am testing a greater ECIIItheta amplitude by increasing the conductanve of the signal to G=5.

Some diagonal encoding reappeared. Retrieval and encoding were again insufficiently distinct.

I'm temporarily setting G of ECIIItheta back to 3 for clean encoding.

Tuning Distinct Mode Controls

Means to achieve distinct training and navigation stages:

  1. Discouraging or preventing association (strengthening of connections) between spikes in respective retrieval and encoding phases of theta.
  2. Driving of the ECIII goal location by PFC in retrieval phases may be turned off during the training stage and turned on during the navigation stage of an experiment.
  3. While spikes must be used to propagate information in ECIII, the strength of recurrent conductances alone may be subthreshold, relying on a combination of (i) recurrent contributions, (ii) a positive retrieval theta phase and (iii) a "seek-next-location" signal to reach threshold.
  4. The conductance of the ECIIItheta signal could be modulated to be greater during navigation (e.g. G=5) and smaller during training (e.g. G=3) stages of the experiment. Again, a motivational drive may be utilized to achieve this modulation.
Desired behaviour in retrieval theta phase: Desired behaviour in encoding theta phase: The distinguishing method (a) above has the added advantage that it also helps to avoid the acquisition of diagonal connections during retrieval phases in the navigation stage of the experiment. Means to achieve desired retrieval and encoding theta phases through controls on method (a) above:
  1. Encoding spikes must occur in a narrow band near the peak of the encoding phase. Retrieval spikes must occur in a narrow band near the peak of the retrieval phase.
  2. Retrieval and encoding phase peaks should be maximally distinct, with a phase difference of approximately 180 degrees.
  3. Either (i) The LTP function must be narrow, or (ii) plasticity must be strongly modulated to avoid connection strength updates in retrieval phases.
The means (1) and (2) above may be modified in the case of an asymmetric theta function. In that case, the phase that needs more active time should take place in the longer of the two theta phases.

The issue marked (*) was problematic upon inspection after training with ECIIItheta conductance G>3. As ECIII cells were more distant from the goal location, their turn in the chain of retrieval appeared later. Retrieval spikes for some cells therefore occurred during the encoding phase. Once that occurs, attempts to distinguish training and navigation phases by methods such as avoiding associations across phases in (a) are powerless. Operations (e.g. learning) that take place in particular phases can only be designated to act on spikes in that specific phase, if the phases and their spikes do not overlap.

Propagation using subthreshold signals

Subthreshold signaling on ECIII recurrents, that implies a dependence on high theta for postsynaptic spiking (**) can achieve (2) and implements (1).
Subthreshold transmission + theta modulation of membrane potentials = robust distinct encoding and retrieval phases for all path lengths.

Some further effort is required when the number of inputs that converge on a cell is not known in advance, such as when multi-cell patterns are used to represent a location or knowledge item. In that case, the maximum recurrent conductance achieved by LTP may need to be adaptable, set perhaps to the connectivity (i.e. sparseness) of patterns.

With the current back-propagation approach, the additional concern is also raised to deal with equidistant cells that receive converging input from multiple paths. A solution for this is presented for the most reliable of the signaling methods discussed below.

Here follows the result that is to be achieved and a number of propagated signal codes that can be used.

The desired result:

Types of signals:

  1. Rapid propagation through ECIII with synaptic delays only, each presynaptic spike causes a postsynaptic spike. Path selection is based on the "analog" time taken for a signal to reach locations adjacent to the current location. This breaks when the path is long, and is mentioned only for comparison, since it is not a code based on subthreshold transmissions.
  2. Semi-stepped propagation based on subthreshold transmission. Propagation is rapid with synaptic delays only during high-theta periods (the retrieval phase). During low-theta periods transmissions cannot elicit spikes, but their response is sufficiently durable to persist into the next high-theta period. Path selection is still based on the "analog" time taken to receive a signal at locations adjacent to the current location, but the code may break down if the goal spikes in each theta period. After the goal location has been spiking for several theta periods, signals from different paths that originated at different goal spike times may arrive in an order that obscures the shortest path. This code is relatively fast.
  3. Stepped propagation with subthreshold transmission. A strong AHP or inhibitory feedback suppresses all but one spike per retrieval phase. (Suppression should last for half a theta period so that suppression caused by afferent input does not deactivate the propagation of retrieval). Path selection is done according to "digitized" timing, in terms of the number of theta cycles taken to activate adjacent locations. If the PFC period is set to activate the goal location only once every N theta cycles, then the obscuring problem above can be avoided.
  4. A code using smaller subthreshold transmissions, which are plausible, can be used to determine the path distance. In semi-stepped fashion an additional requirement is added that a cell must receive high-theta as well as at least M recurrent spikes to fire. Place cells closer to the goal will thereby activaty more frequently in ECIII than more distant place cells. Place cells adjacent to the current location that lead into the shortest path will therefore exert the greatest pull. If multiple tugs are needed to move to another place field the shortest path wins the tug-of-war. This code is fast and relatively reliable.
  5. The smaller subthreshold code can also be employed in stepped fashion. Since stepped mode allows only one spike to occur per discretization unit (e.g. gamma or theta periods) in a cell's retrieval phase, propagation requires summation or integration over those units. This code is very reliable, even if noise causes minor shifts in spike timing. While the code can work with only synaptic delays, discretization is more robust and can utilize empirically observed gamma cycles.
(Note that the concept of "shortest path selection" is not actually this simple, since different place field sizes would distort physical distance.)

An example using code E:

[Goal Distance Dependent Firing]
(goal-distance-dependent-firing.fig)

Cell 6 in the example above exerts a greater pull by spiking more rapidly than cell 4, despite the equidistant location of cell 4 relative to the goal. As each ECIII place cell must receive three recurrent spikes to activate in a retrieval phase, the decrease in activity is exponential with distance. The exponential nature of the decrease outweighs any summation of activity from converging inputs.

The number of spikes needed for activation can be tuned to adjust the robustness of the code in the presence of noise that can change spike timing along various paths. It is not desirable that many spikes are needed, as that slows down propagation. An alternative that maintains strong reliability is to filter inputs (although that would require a modification of the architecture). Each ECIII place cell would receive recurrent input only from a corresponding filter cell. The filter cell would receive recurrent inputs from other ECIII cells, responding with only a single spike per discretization time unit (due to a strong AHP). Thus any number of spikes received in the same discretization time unit are interpreted as a single spike.

It is also possible to suppress activity in the current location in ECIII, which would avoid propagation through that location to equidistant place cells in a loop. Notice that such suppression is not a general solution, because other equidistant locations can exist in loops that don't intersect the current location.

Once the goal information has propagated to the current location, all ensuing steps can happen quite rapidly, even with code E. The most rapid implementation is computed as follows:

An interesting prediction that would support such a code is that certainty and therefore the rate of decision making during navigation will be greater near the goal and lesser the more distant the goal is. I am sceptical that such a prediction would hold in detail, especially with the requirement that place fields act as a measure of distance and that the decrease in the decision rate (which hopefully correlates well with the velocity of motion) would have to be exponential with distance. If verified, only codes D or E described above could model the behaviour. Otherwise, codes B or C may be more plausible. I must check to see if adjustment of the fast AHP will work as desired with the new subthreshold transmissions.

2) Reimplementation in Catacomb 2

Steps and modifications

Reimplemented with Catacomb2, a feature discretizer produces sampled output for the virtual rat location at a specified rate. This can replace the bursting output (40-50 Hz) produced by the place input cells in the previous model implementation. In order to control the duration of the virtual rat's presence in a place field (as defined by its corresponding feature) one can either modify the speed parameter in each trajectory point (or capture a new trajectory point type with the desired speed parameter) or place a scalebar that adjusts the size of the T maze and also adjust the range for the feature discretizer.

The new ECIIinput population has been designed to have AMPA synapses with a fall time of $\tau_{fall}=4$ ms and conductance $G=5$, so that the effect on membrane potential due to only the place input from the feature discretizer is an elevation to $V=-58$ mV for cells corresponding to place fields that the virtual rat is in. This predisposes those cells to fire when receiving spikes from ECIII. The transmission modulation function alone does not do very much to avoid spiking in the ECII buffer in a particular phase, since responses are slow and prior contributions can suffice combined with even a small transmission to achieve spiking. The addition of the membrane potential modulation should complement this to achieve the desired effect.

New Direction:

Instead of reimplementing the ECII buffer as it was, a new implementation is made, using the 3-4 item STM buffer model based on dual oscillations. The aim is to reimplement, while correcting older limitations of the model. The improvements are:

  1. Place field size independence: This will allow the model to work with place fields of varying size, such as are produced with the new feature discretizer component of Catacomb 2. It will also allow us to experiment with smaller place field sizes that can more correctly demonstrate phase precession and support exploration and navigation along diagonal paths.
  2. Unidirectional learning: This will help to store actual episodes in their proper order and avoid problems previously encountered in mazes with circuits. Retrieval should then be limited to actual memories of exploration, excluding paths that would result from the reversal of stored experiences.

Theta modulation:

In order to achieve a sinusoidal theta modulation of the membrane potential between -65 mV and -55 mV, it was necessary to set the conductance of the clamp to 980, given a generated signal between -70 mV and -50 mV at 6.3 Hz. These values may be somewhat different for a signal with a frequency of 8 Hz.

Instead of using a sinusoidal function to approximate theta oscillations, theta stimulation can be modeled as a combination of depolarization caused by reduced K+ leak current and periodic responses to strong inhibition caused by a network of interneurons with GABA_B synaptic channel connections to the pyramidal cell body.

The depolarization due to reduced K+ leak current is implemented by changing the pyramidal cell's resting potential to $-53$ mV. (The reset potential may also be moved from -60 to -53 mV.)

The amplitude of the inhibitory response is assumed to be $-10.5$ mV. The rise and fall times of the synaptic response function are set to emulate a rapid hyperpolarization with a return to resting potential according to $((t-t_{fire})/\tau_{GABA_B})\exp(1-(t-t_{fire})/\tau_{GABA_B})$ , where $\tau_{GABA_B}=30$. This was achieved for a pyramidal cell with $T_{leak}=50$ ms, using the synaptic parameters $G=900$, $E=-90$ mV, $T_{rise}=0.001$ ms, $T_{fall}=30$ ms. (Note that the conductance and potential differences are very similar to those of the clamped sinusoidal signal, G=980 and voltage between -70 mV and -50 mV.)

An alternative set of parameters that produces a theta modulation based on $GABA_B$ channel responses with the same amplitude, but a steeper positive flank is: $G=1150$, $T_{fall}=20$ ms, $V_{rest}=-53.5$ mV.

Modifications of the STM network:

The current modifications that needed to be applied seem to be pushing the STM network into a realm with much greater conductances. Theta modulation through GABA_B dynamics required conductances of 900, and the induction of spiking by afferent input during a low theta mode required a conductance of 1250 with $E=20$ mV, $T_[rise]=2$ ms and $T_{fall}=5$ ms. To achieve repetition of the spike, the ADP conductance was raised from the original 3 to 400! It is not yet entirely clear why this escalation is taking place. The cell's capacitance may be involved, or it may be the result of a general change in the equivalent electrical network caused by the different delivery of theta modulation. With these values, proper behaviour is not yet achieved, since consecutive items in STM merge.

Issues/Bugs awaiting resolution:

Changes since the above (20020110): (1) Laplace transformations are now being used to determine analytically how operational criteria can be specified. (Figure: neuronal-balance-calculations.fig) (2) The conductances have been standardized to nS. This breaks all previous models, and requires significant adjustments to the STM model and other parts of the new implementation. (3) Since changes were needed anyway, the STM model is being adjusted to work with the local theta frequency (8 Hz) and the pyramidal resting potential chosen here (-70 mV rather than -60 mV). A good system for the implementation of the STM model should be independent of the choice of such parameters in any case.

ADP and theta slopes: As long as the ADP responses of consecutive memory items are similar and therefore parallel, separation is best maintained when the response to theta is as flat as possible near the positive peak of theta (with an assumption of additive properties for theta and ADP). Hence, a theta modulation generated synaptically is most beneficial if it has fairly rapid fall timing characteristics, yet sufficient conductance to enforce theta cycles strongly. An example of appropriate parameters may be synaptic reversal potential $E=-90$ mV, characteristic fall timing $\tau_{fall}=2$ ms and conductance $G=200$ nS.

Criteria for a STM buffer recipe: Once the Laplace results for stability criteria are known, a recipe for this STM model design can include specification of the theta synapse. In order to specify a proper theta modulation, the parameters that must be taken into account are: resting potential in the target population pyramidal cells, $V_{rest}$ (e.g. -70 mV), the threshold potential, $V_{thres}$ (e.g. -50 mV), the desired theta amplitude (e.g. 10 mV for modulation between -80 mV and -60 mV), the theta frequency, $f_{theta}$ (e.g. 8 Hz), the leak conductance in the target cells, $G_{leak}$. There may be more. Parameters, such as the adjusted resting potential in target cells, the theta synapse conductance and characteristic timing can then be calculated accordingly. Dependencies propagate from there, e.g. interneuronal spike timing may be adjusted to achieve a number of gamma periods in the positive portion of a theta period.

Theta implementation: With rise timing $\tau_{rise}=0.01$ ms, a combination of fall timing $\tau_{fall}=2$ ms and conductance $G=240$ nS achieves a slope in which theta rises 10 mV in 85 ms in the positive portion of theta. With a combination of fall timing $\tau_{fall}=1$ ms and conductance $G=480$ nS that slope is 10 mV in 88 ms. The two slopes are very similar, so both combinations are usable.

Afferent input implementation: With the new theta input, afferent input is properly received (at least if presented in an encoding phase) with typical AMPA timing characteristics ($\tau_{rise}=1$ ms and $\tau_{fall}=2$ ms), excitatory AMPA reversal potential $E=0$ mV. Afferent spikes are achieved and multiple spikes or bursts avoided with synaptic conductance amplitudes between $G=40$ nS and $G=100$ nS.

Pyramidal AHP implementation: It is not quite clear yet, how strong the AHP should be. The AHP should probably avoid multiple spikes, therefore it should have a rapid onset, $\tau_{rise}=0.0001$ ms, immediately following the absolute refractory period of 2 ms. It should probably also inhibit multiple spiking within a theta period, unless achieved with the aid of the recurrent excitatory activity of an overlapping memory pattern. For this reason, it's fall time is currently relatively long, $\tau_{fall}$ between 5 ms and 10 ms. If its reversal potential is similar to that of inhibitory input, it can be set to $E=-90$ mV. With these values, the conductance can be between $G=200$ nS and $G=800$ nS.

ADP implementation: The ADP is properly achieved with $E=-10$ mV, $G=10$ nS, $\tau_{rise}=800$ ms, $\tau_{fall}=1$ ms and a duration of 125 ms. The conservative setting of the duration parameter avoids superposition of ADP onto subsequent theta cycles, which is probably physiologically plausible. The ADP parameters may need to be adjusted if physiological reasons warrant it.

Operation of STM buffer based only on ADP: With the settings above, repetition of afferent input during the retrieval portion of each theta cycle is achieved. Different items do still merge after about three theta cycles of separate coexistance, which may be sufficient for the purposes of this model. So that the implementation of the STM buffer model is satisfactory, the recurrent GABAergic inhibition is attached for the generation of gamma cycles.

Interneuronal network for gamma oscillations: The interneuronal network node is currently set up with an AMPA input ($\tau_{rise}=1$ ms and $\tau_{fall}=2$ ms) with conductance $G=30$ nS, which causes the node to spike (the conductance is twice the leak conductance of the interneurons with $r=35$ micron and $\tau_{int}=10$ ms). The AMPA input is not too strong, so that multiple gamma spikes can be avoided through the interneuronal AHP. That AHP is set to $E=-90$ mV, $G=200$ nS, $\tau_{rise}=0.0001$ ms and $\tau_{fall}=7$ ms, with a duration of 20 ms (so that the AHP will not be superimposed on a subsequent gamma period). These parameters may need to be adjusted if the gamma cycles do not cooperate satisfactorily with the theta cycles.

Dynamic requirements: There is an issue concerning the desired strength and dynamics of gamma contributions and the time constant of the pyramidal cell membrance capacitance in the STM buffer. The current setting of $\tau_{pyr}=50$ ms limits the rapidity of response to gamma dynamics. That implies that gamma responses can be only either of small amplitude with a small period (the desired 20 to 25 ms) or of large amplitude (as may be needed for adequate separation of gamma periods) with a greater period (which allows too few gamma periods to exist in the positive portion of a theta cycle). For this reason, it may be necessary to speed up the pyramidal dynamics.

Updated Pyramidal dynamics: A basis for an updated pyramidal cell may be physiological characteristics (such as those at http://neuron.princeton.edu/~mol437/references/brain.ps). A human pyramidal cell can have a typical radius of about 20 micon. Leak conductance can be estimated (an example is at http://www.cnbc.cmu.edu/~bard/synapse/node1.html), e.g. with $0.1 mS / cm^2$. Here, the leak conductance is kept at approximately its former value (to minimize necessary adjustments in other parameters), $G_{leak}=6.1$, by setting the radius to the plausible value of 22 micon and the time constant to $\tau_{pyr}=10$ ms. With these settings and a conductance from the interneuronal network to each pyramidal cell of $G=50$ nS, the GABAergic in hibition in gamma cycles can achieve a hyperpolarization of about 10mV in a resting pyramidal cell, which returns to $V_{rest}$ within the gamma period. (Similar useful pyramidal parameter settings are $r=22$ micron and $\tau_{pyr}=8$ ms, which lead to $G_{leak}=6.28$ nS.)

Parameters adjusted for new dynamics: An additional benefit of the faster pyramidal dynamics is the ability to more precisely control the theta response. It is fairly easy to specify a synapse fall timing that allows potentials to return to the resting potential, so that a K+ modified resting potential of -60 mV can be specified. The flatness of the positive portion of theta (which can be a proportionate majority of the theta period) can be adjusted between very flat (for theta timing characteristics $\tau_{fall}=10$ ms) to gently sloped ($\tau_{fall}=14$ ms) or more steeply sloped to compress consecutive spikes ($\tau_{fall}=20$ ms). Using the gentle slope, the desired theta amplitude of 20 mV is achieved with a conductance $G=22$ nS. Single afferent spikes are maintained with an afferent conductance modified to between $G=15$ nS and $G=20$ nS, although a much broader range is acceptable if absolute avoidance of double spikes is not a great priority. The AHP was adjusted to $\tau_{fall}=15$ ms (duration $100$ ms so that it is computed for the entire theta period following a spike) with a conductance between $G=50$ nS and $G=400$ nS, although a broader range of parameter values may also be applicable if such becomes a functional requirements of the AHP. The ADP conductance was modified between $G=6$ nS and $G=10$ nS (for maximal usage of the positive portion of theta, $G=12$ nS already causes some items to fire twice in a theta cycle, once at the very beginning of the positive portion and once at its end).

Gamma cycles: The interneuronal output is connected to the pyramidal cells of the STM buffer via synapses with $E=-70$ mV, $\tau_{rise}=1$ ms, $\tau_{fall}=5$ ms and a conductance $G=20$ nS. The STM buffer maintains two items in the order of their presentation for about 500 ms (four theta cycles). The items decay and are consequently lost from the STM buffer in their order of presentation. Each item spikes 7-8 times. By increasing the ADP duration from 125 ms to 150 ms the decay problem was solved. Apparently, the continual shifting to earlier onset times during the positive portion of theta caused the decay - the ADP did not span the necessary duration to achieve spiking in a subsequent theta period. The STM buffer now maintains two items in their proper order in consecutive gamma cycles of each theta cycle indefinitely!

The STM buffer can also hold three items in their proper order for a limited amount of time, but needs some further adjustment so that it can do so indefinitely.

STM buffer input: Information arriving at the STM buffer should be filtered, so that approximately one new afferent item is presented per theta cycle. Both ECIIinput and the STM buffer in EC receive input directly from place activity (generated by a FeatureDiscretizer concatenated with a SpikeSplitter object). The place activity is regular (at 50 Hz). The ECIIinput population will be adjusted so that a single spike occurs for each burst of place activity, while the STM buffer should spike at the retrieval phase of each theta cycle. The afferent place input should be presented during the encoding phase of the theta cycle, with a sufficient AHP. If repeated presentation threatens to flush the buffer before subsequent places can be associated, then presentation should not occur on each encoding phase, but only on those where novel inputs are received (a new location is entered). This remains to be seen, but can be accomplished with a feedback loop that modifies presentation strength according to novelty.

In its current condition, the STM buffer may already achieve its goals with regard to the needs of the current model, even without further filtering of the input. The AHP already appears to be strong enough to limit spiking to one spike per item per theta cycle. Neither the maintenance of order, the effect of multiple presentations of the same place input, or the timing of afferent input can be guaranteed at this time, it does appear that successive place input is repeated in sufficient temporal proximity to allow LTP to occur. This is adequate as long as bidirectional connections are permitted in path learning.

Immediate STM buffer objectives:

Future STM buffer objectives::

Issues with the ECII input population: (a) Why do all cells appear to spike once at t=0 ms? (b) Cells should spike only in the presence of sufficient place input and ECIII input. Issue (b) is taken care of by modifying the conductance of the input synapses by the factor 100 that reflects that change in units. With $G=0.05$ nS, the proper behaviour is achieved, except that ECIII output may still be incorrect.

Issues with ECIII output: The first task is to get ECIII to follow the information coming from the STM buffer in ECII. This is also an opportunity to take some of the parameter variations out of the model and to standardize on some concrete types of neurons. The ECIII neurons are made to correspond with the model of entorhinal pyramidal cells used in the STM buffer, with the aim that their behaviour is a predefined response to theta modulation and gamma cycles. The clamp is removed, so that the more biologically plausible synaptically induced theta can be added when desired. The AHP is modified to resemble that used in the ECII STM buffer. The theta and gamma synapses correspond to those in the STM buffer, while the cell is recalibrated ($V_{rest}=V_{reset}=-60$ mV, $r=22$ micron, $\tau_{pyr}=10$ ms) so that theta can modulate membrane potential around an average of -70 mV. LTP is temporarily deactivated by setting $G_{max}=G$. This can help to achieve the desired behaviour induced by input from the STM buffer. That input is given a conductance $G=15$ nS. This achieved ECIII spiking that follows ECII STM buffer spiking perfectly.

ECIII learning and ECII input buffer behaviour: When LTP is activated by setting $G_{max}=15$ nS in ECIII, autoassociative learning does occur as expected. Unfortunately, item repetitions in the ECII STM buffer are as yet continuous, so that eventually all cells become associated. The ECIIinput population is corrected by adding a seperate synapse for ECIII input, with stronger conductance $G=0.15$ nS. This produced the desired ECII input population behaviour, with a single spike where place input and ECIII input occur together.

STM buffer slow AHP: To limit the duration of STM repetition, a functional slow AHP is added. Currently, a simple way to add this is through an additional synapse in the STM buffer ($E=-70$ mV, $G=0.5$ nS, $\tau_{rise}=800$ ms, $\tau_{fall}=10$ ms, duration=1200 ms) and a network pathway with one-to-one connections from the buffer output to that synapse. This implementation is used, since response functions (such as the fast AHP and ADP responses) are by definition reset at each spike, while the function governing the shut-off of item repetition must act over multiple spikes. The network pathway connection object cannot (currently) be captured within a neuron object, hence it is connected externally (delay=1 ms). By setting $G=0$ on the slow AHP synapse, the buffer can operate in the indefinitely repetitive mode. The chosen conductance allows an item to be repeated for 7 to 9 theta cycles, approximately 800 to 1000 ms. Some precision issues remain with the STM buffer, in addition to the points noted above: (1) As the first two place inputs appear, they are not kept in their order of presentation. (2) Some new input appears to spike more than three times during the first theta cycle (the first spike may actually appear at the very end of the previous theta cycle). (3) As the virtual rat traverses the path backward, STM buffer cells in those locations are still suppressed to some degree by the slow AHP (perhaps the slow AHP should be turned off completely once repetition has stopped). The slow AHP should probably also turn off repetition more rapidly, after 3 theta cycles for instance.

Sequence Learning: The STM buffer needs to be adjusted to solve (3) above - a transmission modulation in the afferents may suffice. Another interesting modification may be to give afferents slow dynamics, but have them be subthreshold during the encoding phase. That way, the STM buffer would not spike for afferent input, so neither would ECIII. Preferably, the order should also be retained in the STM buffer (with adequate flushing of older items). Learning in ECIII and CA3 should be primarily heteroassociative (changing the onset of the LTP function may help, see papers about LTP including the author Poo from 1998), and should convey the sequence order by binding only consecutive items. Since a ``Cumulative Response'' object is now available in Catacomb, I am replacing the improvised slow AHP circuitry with that. The cumulative slow AHP is tested with the parameters $E=-70$ mV, $G=1$ nS, $\tau_{rise}=400$ ms, $\tau_{fall}=10$ ms and duration=400 ms. Some adjustements will be necessary, once the afferent input has been regulated, to produce the desired effect in the STM buffer.

Sequence Learning - cleaning up STM: I am adjusting the slow AHP and adding transmission modulation to deal with point (3) above. While it is desirable that the transmission modulation on recurrent collateral synapses follows the membrane potential modulation caused by theta oscillations, the transmission modulation on afferent input should allow afferent spiking in only a very restricted portion of the theta cycle. To achieve this, the GABA_B receptor modulation of afferent terminal transmission was rescaled so that a broader period of suppression resulted. By combining such transmission modulation with a slow afferent synaptic response, arbitrary spike arrival times can be modulated to approach desired synchronization. This is a model for the stages leading from arbitrary sensory spike timing to synchronous processing in theta cycles. Synchronicity can be refined as spikes propagate through multiple layers of excitatory cells with theta modulated membrane potentials and afferent synaptic transmission. (Of course this can also be done using consecutive layers with transmission modulation in phase with membrane potential modulation.) This approach appears to be able to solve point (2) above as well. (Note that the rescaling is currently off, as the quality of the spiking on the first cell of the STM buffer appeared to be almost the same with or without rescaling.) Lowering of the afferent conductance to 10 nS resulted in a cleaner sequence memory for spikes on the first to place cells, but spiking still occurs during the encoding period and the first spike of the second place field still falls on the end of a retrieval phase of a theta period. The desired transmission modulation function for afferents to the STM buffer is achieved by reversing the rescaling, so that the function is a mirror image of the membrane theta modulation. But is there a plausible physiological way for such a function to appear at the terminals of afferent fibres? And... why do the second place cell's spikes no longer achieve STM buffer spiking?

Goal Activity: In order to achieve the effect of goal activity in ECIII, three things must be verified: (1) Heteroassociative pairs must be maintained in the STM-buffer. (2) A strengthening of synaptic efficacies must take place between consecutively active cells in an episode in ECIII. (3) The goal location must activate and periodically reactivate in PFC as a result of the combination of place activity and the discovery of the goal. Then the activity of the goal can retrieve episodes in ECIII through backwards spread. The PFC population is implemented in a similar manner as the STM-buffer population, but without interneuronal network input to create gamma cycles and without a slow AHP, so that repetition is indefinite. Two synaptic populations are provided for afferent input, one from the goal detection signal and the other from place input. The conductance from place input is set to $G=3$ nS, so that activity remains just below threshold in the absence of a goal detection signal. The conductance of the proximity signal (from the ingester component) was set to $G=2$ nS, so that its response also remains below theshold in the absence of place input. Where the proximity signal and place input combine, spiking is induced. That spiking is repeated indefinitely, once on each theta cycle. By increasing the conductance on afferents from the STM-buffer to ECIII to $G=25$ nS, all spikes in the STM-buffer are properly conveyed. It is possible to set the conductance of the STM-buffer's slow-AHP to $G=1.1$ nS, so that the first and second place field spikes co-occur closely for an extended period of time. A better solution would be to have the first place field spike occur earlier in its second theta cycle, so that the second one can spike in the proper order. This better solution is initiated by raiding the ADP conductance in the STM-buffer from $G=8$ nS to $G=10$ nS. This is still within the range advised above (between $G=6$ nS and $G=10$ nS). I then increased the slow-AHP conductance to $G=1.8$ nS, causing only 4 cycles of the first and second place cell firing to occur, in their proper order. (On occasion, some temporary convergence may occur as a place cell firing diminishes due to slow-AHP, although this was not the case for the first and second place cell firing here.) The PFC output was then connected to the same synaptic population on ECIII as the STM-buffer output.

Retrieval versus Exploration: In future experiments, the removal of the goal should cause the virtual rat to initiate exploratory behaviour, instead of continuing to recall learned episodes. This is implemented through circuitry that links both network output and random spikes that direct the virtual rat to a place field into the Feature De-Discretizer (the reverse I/O through the FeatureDiscretizer). The random events are suppressed by goal activity. This is done by feeding the output of PFC into an interneuron labelled "exploration-inhibition", via a strong synapse. That interneuron drives a strong GABA-ergic input with rapid rise time ($\tau_{rise}=0.1$ ms) and slow fall time ($\tau_{fall}=100$ ms and duration=125 ms) to a population of neurons that receive the random spiking events.

Necessary Operational Improvements: The following list of improvements is deemed necessary, after a review of the network performance, as depicted in graphs on the Ratlab Model Images page.

Operational Issues 20020131:

  1. Why are some cells deactivating slower, despite having the same sAHP?
    Much of this appears to be a result of the constant application of a uniform sAHP, despite different place field traversal durations. This may be dealt with by either (a) arranging the durations such that the problem does not occur, (b) implemented a method whereby the third item replaces and deactivates the first item in the STM buffer, or (c) an externally applied inhibition that commences as place cell firing ceases.

    An optimal STM buffer for human memory acquisition has the following properties:

    1. In the design of an STM buffer of size N items, you clearly don't want item 1 to hang around when item N+1 arrives.
    2. The STM buffer would also be most useful if it can be used equally well for a range of item arrival speeds. Not all ranges are desirable, e.g. intervals of days require other integrative processes, such as reactivation of LTM in SWS. That is not remembered as an ``episode'', although integration can still deduce potential cause-effect relationships. Reasonable ranges are from 10s of milliseconds to 10s of seconds, and perhaps somewhat narrower than that.
    3. Preferably, you would like to retain items in STM until the next item in the sequence appears... within reason, it is acceptable that STM decays over several seconds. This implies that slow AHP and the effects of noise should result in decay over several seconds. There may also be an effect of habituation.
    Options for the implementation of mechanisms for the necessary behaviour are:
    1. The appearance of item N+1 at the inputs immediately suppresses item 1 in STM.
    2. The appearance of items 2 and on at the inputs begins a gradual inhibition (through competition) of item 1 in STM.
    3. The addition of item 2 and on in STM competes with item 1, eventually causing it to deactivate.
    4. The addition of item N+1 to the STM buffer pushes item 1 out of STM. According to Jensen and Lisman, the drop-out of the first item when an item N+1 is added to an STM buffer with capacity N, is due to the decline of ADP just after a subsequent theta cycle.

      This should be achievable if the ADP can be made to drop off rapidly after a specific amount of time, although the cell's own capacitance will limit that rapidity. It is not desirable to fiddle much with the shape of the ADP as it currently is, since its relative steepness is very useful for reliably distinct STM spikes. If the buffer is full, the first item will be relatively close to the afferent input. The afferent input can cause something like a gamma cycle to delay the first item. The time between the spiking of the first item in successive theta cycles was already stabilized to about a theta period, so that an additional delay could bring it beyond the effective range of the ADP. This approach relies on several potentially implausible or difficult components to add: (i) rapid drop off of ADP after a duration equal to a theta period, (ii) an inhibition caused by the afferent spike that does not undo the reliability achieved by adding transmission modulation to the recurrent GABA-ergic input.

      Perhaps the ADP rise time can be shortened further if the conductance is increased, thereby increasing the STM buffer capacity further, without losing robustness. The ADP can be stopped by setting the duration to about 130 ms. That may seem unrealistic, but really just involves the production of a different response function (not a double-exponential shape). That could be possible if a different intrinsic process begins to operate after the rise time of the ADP. When relying on gamma cycles to cause the suppression that would lead to the dropping of the first item, the parameter selection is extremely difficult and fragile. This does not look like a plausibly robust implementation.

    I'm putting in a temporary cludge that is not biologically plausible, but will allow me to mull over the STM issue, while fixing the rest of the network! The previous version of the model, with slow AHP is retained in the copy newtmaze.STM-slowAHP.ccm.

    Two other possibilities that may offer more plausible ways to replace STM items are:

    This issue has been dealt with for neuronal circuitry that implements oscillatory STM with FIFO properties.

    The slow AHP, now responsible only for gradual deactivation without replacement by new items that is also caused by decoherence through noise, is set to $\tau_{rise}=3000$ ms, $\tau_{fall}=400$ ms, duration=4000 ms and conductance $G=0.05$ nS. These values can be easily modified if a retention greater than three seconds is necessary.

    An interesting fact is that a slight increase of the ADP steepness ($\tau_{rise}=290$ ms) to insure that the spiking of the third item is not omitted just prior to the introduction of the fourth item, causes the fourth item to replace the first on its second theta cycle, but also causes it to take its place, out of order.

    The main problem with all counter or integrator methods that deactivate the first item is that the new item cannot spike in the right order, because items 2 and on are still in their original slots within the theta period. For this reason, it might be better that whichever method is chosen should commence inhibiting as soon as the 3rd item enters the STM buffer. The inhibition can then be gradual, such as a slow AHP. The first item can then be deactivated a few theta cycles before the next afferent item appears, allowing all items to move into their new positions before the new item appears.

    A third synapse population is added to the STM-shutoff cell population, which provides an offset that brings only cells that are spiking in the STM buffer closer to the threshold. With the parameters in the models newtmaze.inhibition-at-4th-item.ccm and newtmaze.inhibition-at-3rd-item.ccm, the first item is shut off at 4th and 3rd item presentations respectively. While neither model includes gradual shut-off, shut-off at the 3rd item presentation demonstrates that the method works when there is sufficient space for the new item to be retrieved in the same theta period. Obviously, a more plausible implementation of the method, that enables 3 item STM must eventually replace the current implementation. The power of the ADP needed to be increased ever so slightly ($\tau_{rise}=285$ ms and $G=3.2$ nS) for the 2 item STM version with shut-off at the 3rd item presentation, so that each new item would be retrieved within the theta period in which it was presented. This adjustment may not be necessary or desirable in other versions and future better implementations.

  2. Why does cell 3 activity only barely get close to cell 4 activity once?
    This is also solved by the solution to the point above.
  3. Make sure that ECIII is receiving the patterns from STM properly.
    The patterns are arriving from both STM and PFC and causing ECIII to spike accordingly. As only two items are currently being held in STM at one time, and gamma cycle compression is not strong, there is a span of 50 ms between the two consecutive items in STM. This span must be bridged by the LTP function. How to insure that learned episodes do not interfere with ongoing learning, e.g. recall of stem-left trajectory while exploring the stem-right trajectory?

    Problems of retrieval and encoding in the same theta periods:

    If cells PC1 and PC2 spike in consecutive gamma cycles of STM and their activity is strongly transferred to ECIII, the LTP window will allow the synaptic strength from PC1 to P2 to be increased.

    ECIII retrieval can be: (a) A response to the activity during the encoding period, or (b) an entirely separate sequence of afferent input followed by responses during high theta. Option (b) can work well to retrieve only in response to goal location activity and subsequent recurrent recall. Retrieval during one theta period can be very fast (long episodes), slower (in gamma cycles, as mentioned above), or even just one per theta period, depending on synaptic delays. The fast choice may be the most efficient one for the backwards spread hypothesis, but may not be justified by empirical recordings, unless the gamma frequencies recorded there are evidence only of oscillatory activity in the encoding phase of ECIII.
  4. Assure that the LTP in ECIII is designed such that it captures all the heteroassociative pairs, add plasticity modulation.

  5. Assure that CA3 also manages that learning.