The small space between the sending neuron and the receiving neuron is the

If you're seeing this message, it means we're having trouble loading external resources on our website.

Table of Contents Show

Graphical abstract
Introduction
Growth of neuronal “wetware” for computation
Neural cells show well-characterized spontaneous action potentials that develop over time
Building a modular, real-time platform to harness neuronal computation
Increasing the density of sensory information input leads to increased performance
BNNs learn over time when embodied in a gameplay environment
Learning effects in BNNs are observed across additional measures
BNNs require feedback for learning
Dynamics in electrophysiological activity display coherent connectivity
Electrophysiological activity is linked with higher average rally length
What refers to tiny spaces between neurons?
Where is the synaptic gap?
What is the small gap between neurons quizlet?
What is the gap between neurons called quizlet?

If you're behind a web filter, please make sure that the domains *.kastatic.org and *.kasandbox.org are unblocked.

Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, London, UK

Search for articles by this author

Ben Rollo

Affiliations

Department of Neuroscience, Central Clinical School, Monash University, Melbourne, Australia

Search for articles by this author

Adeel Razi

Affiliations

Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, London, UKTurner Institute for Brain and Mental Health, Monash University, Clayton, VIC, AustraliaMonash Biomedical Imaging, Monash University, Clayton, VIC, AustraliaCIFAR Azrieli Global Scholars Program, CIFAR, Toronto, Canada

Search for articles by this author

Karl J. Friston

Affiliations

Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, London, UK

Search for articles by this author

Author Footnotes

11 Lead contact

Open AccessPublished:October 12, 2022DOI:https://doi.org/10.1016/j.neuron.2022.09.001

In vitro neurons learn and exhibit sentience when embodied in a simulated game-world

Previous ArticleMyeloid cell interferon secretion restricts Zika flavivirus infection of developing and malignant human neural progenitor cells

Next ArticleLong-range functional loops in the mouse olfactory system and their roles in computing odor identity

Highlights

•
Improvements in performance or “learning” over time following closed-loop feedback
•
Learning observed from both human and primary mouse cortical neurons
•
Systems with stimulus but no feedback show no learning
•
Dynamic changes observed in neural electrophysiological activity during embodiment

Summary

Integrating neurons into digital systems may enable performance infeasible with silicon alone. Here, we develop DishBrain, a system that harnesses the inherent adaptive computation of neurons in a structured environment. In vitro neural networks from human or rodent origins are integrated with in silico computing via a high-density multielectrode array. Through electrophysiological stimulation and recording, cultures are embedded in a simulated game-world, mimicking the arcade game “Pong.” Applying implications from the theory of active inference via the free energy principle, we find apparent learning within five minutes of real-time gameplay not observed in control conditions. Further experiments demonstrate the importance of closed-loop structured feedback in eliciting learning over time. Cultures display the ability to self-organize activity in a goal-directed manner in response to sparse sensory information about the consequences of their actions, which we term synthetic biological intelligence. Future applications may provide further insights into the cellular correlates of intelligence.

Graphical abstract

Graphical Abstract

View Large Image
Download Hi-res image
Download (PPT)

Keywords

cell culture
microphysiological systems
learning
intelligence
electrophysiology
neurocomputation
synthetic biological intelligence
free energy principle
in vitro
neurons

Introduction

Harnessing the computational power of living neurons to create synthetic biological intelligence (SBI), previously confined to the realm of science fiction, may now be within reach of human innovation. The superiority of biological computation has been widely theorized with attempts to develop biomimetic hardware supporting neuromorphic computing (

Kumar S.
Williams R.S.
Wang Z.

Third-order nanocircuit elements for neuromorphic engineering.

Nature. 2020; 585: 518-523https://doi.org/10.1038/s41586-020-2735-5

Crossref
PubMed
Scopus (95)
Google Scholar

). Yet no artificial system outside biological neurons is capable of supporting at least third-order complexity (able to represent three state variables), which is necessary to recreate the complexity of a biological neuronal network (BNN) (

Izhikevich E.M.

Dynamical Systems in Neuroscience: The Geometry of Excitability and Bursting.

The MIT Press, 2006https://doi.org/10.7551/mitpress/2526.001.0001

Crossref
Google Scholar

;

Kumar S.
Williams R.S.
Wang Z.

Third-order nanocircuit elements for neuromorphic engineering.

Nature. 2020; 585: 518-523https://doi.org/10.1038/s41586-020-2735-5

Crossref
PubMed
Scopus (95)
Google Scholar

). While significant progress has been made in mapping in vivo neural computation, there are technical limits to exploring this in vitro (

Barron H.C.
Reeve H.M.
Koolschijn R.S.
Perestenko P.V.
Shpektor A.
Nili H.
Rothaermel R.
Campo-Urriza N.
O’Reilly J.X.
Bannerman D.M.
et al.

Neuronal Computation Underlying Inferential Reasoning in Humans and Mice.

Cell. 2020; 183: 228-243.e21https://doi.org/10.1016/j.cell.2020.08.035

Abstract
Full Text
Full Text PDF
PubMed
Scopus (36)
Google Scholar

). Here, we aim to establish functional in vitro BNNs from embryonic rodent and human-induced pluripotent stem cells (hiPSCs) on high-density multielectrode arrays (HD-MEAs) to demonstrate that these neural cultures can exhibit biological intelligence—as evidenced by learning in a simulated gameplay environment to alter activity in an otherwise arbitrary manner—in real time (). It is proposed that these neural cultures would meet the formal definition of sentience as being “responsive to sensory impressions” through adaptive internal processes (

Friston K.J.
Wiese W.
Hobson J.A.

Sentience and the Origins of Consciousness: From Cartesian Duality to Markovian Monism.

Entropy. 2020; 22: 516https://doi.org/10.3390/e22050516

Crossref
Google Scholar

). Instantiating SBIs could herald a paradigm shift of research into biological intelligence, including pseudo-cognitive responses as part of drug screening (

Kagan B.J.
Duc D.
Stevens I.
Gilbert F.

Neurons Embodied in a Virtual World: Evidence for Organoid Ethics?.

AJOB Neurosci. 2022; 13: 114-117https://doi.org/10.1080/21507740.2022.2048731

Crossref
PubMed
Scopus (1)
Google Scholar

;

Myers D.
Goldberg A.M.
Poth A.
Wolf M.F.
Carraway J.
McKim J.
Coleman K.P.
Hutchinson R.
Brown R.
Krug H.F.
et al.

From in vivo to in vitro: The medical device testing paradigm shift.

ALTEX. 2017; 34: 479-500https://doi.org/10.14573/altex.1608081

Crossref
PubMed
Scopus (25)
Google Scholar

), bridging the divide between single-cell and population-coding approaches to understanding neurobiology (

Ebitz R.B.
Hayden B.Y.

The population doctrine in cognitive neuroscience.

Neuron. 2021; 109: 3055-3068https://doi.org/10.1016/j.neuron.2021.07.011

Abstract
Full Text
Full Text PDF
PubMed
Scopus (21)
Google Scholar

), exploring how BNNs compute to inform machine-learning approaches (

Mattar M.G.
Lengyel M.

Planning in the brain.

Neuron. 2022; 110: 914-934https://doi.org/10.1016/j.neuron.2021.12.018

Abstract
Full Text
Full Text PDF
PubMed
Scopus (6)
Google Scholar

), and potentially giving rise to silico-biological computational platforms that surpass the performance of existing purely silicon hardware. Theoretically, generalized SBI may arrive before artificial general intelligence (AGI) due to the inherent efficiency and evolutionary advantage of biological systems (

Buchanan M.

Organoids of intelligence.

Nat. Phys. 2018; 14: 634https://doi.org/10.1038/s41567-018-0200-2

Crossref
Scopus (4)
Google Scholar

Figure 1DishBrain system and experimental protocol schematic

Neuronal cultures derived from hiPSC via DSI protocol, NGN2 lentivirus-directed differentiation, or primary cortical cells from E15.5 mouse embryos were plated onto HD-MEA chips and embedded in a stimulated game-world of “Pong” via the DishBrain system. Different DishBrain environments were created by altering the pattern of sensory information (yellow bolts), feedback (colored bolts), or no stimulus (red crosses) to demonstrate (1 and 2) low-latency, closed-loop feedback system (stimulation (STIM) and silent (SIL) treatment); (3) no-feedback (NF) system to demonstrate an open-loop feedback configuration; and (4) rest (RST) configuration to demonstrate a system in which sensory information is absent. Interactive visualizer of activity and gameplay: https://bit.ly/3DSi4Eg.

View Large Image
Download Hi-res image
Download (PPT)

This system, termed DishBrain, can leverage the inherent property of neurons to share a “language” of electrical activity to link silicon and BNN systems through electrophysiological stimulation and recording. Given the compatibility of hardware and cells (wetware), it is necessary to investigate what processes would result in intelligent (goal-directed) behavior when BNNs are embodied through a closed-loop system. Two interrelated processes are required for sentient behavior in an intelligent system. Firstly, the system must learn how external states influence internal states via perception and how internal states influence external states via action. Secondly, the system must infer from its sensory states when it should adopt a particular activity and how its actions will influence the environment. To address the first imperative, custom software drivers were developed to create low-latency closed-loop feedback systems that simulated exchange with an environment for BNNs through electrical stimulation. Closed-loop systems afford an in vitro culture “embodiment” by providing feedback on the causal effect of the behavior from the cell culture. Embodiment requires a separation of internal versus external states where feedback of the effect of an action on a given environment is available. Previous works, both in vitro and in silico, have shown that electrophysiological closed-loop feedback systems engender significant network plasticity (

Bakkum D.J.
Chao Z.C.
Potter S.M.

Spatio-temporal electrical stimuli shape behavior of an embodied cortical network in a goal-directed learning task.

J. Neural. Eng. 2008; 5: 310-323https://doi.org/10.1088/1741-2560/5/3/004

Crossref
PubMed
Scopus (80)
Google Scholar

;

Chao Z.C.
Bakkum D.J.
Potter S.M.

Shaping Embodied Neural Networks for Adaptive Goal-directed Behavior.

PLoS Comput. Biol. 2008; 4: e1000042https://doi.org/10.1371/journal.pcbi.1000042

Crossref
PubMed
Scopus (50)
Google Scholar

). Further support is found in vivo by disrupting the closed-loop coupling between visual feedback and motor outputs in the primary visual cortex of mice (

Attinger A.
Wang B.
Keller G.B.

Visuomotor Coupling Shapes the Functional Development of Mouse Visual Cortex.

Cell. 2017; 169: 1291-1302.e14https://doi.org/10.1016/j.cell.2017.05.023

Abstract
Full Text
Full Text PDF
PubMed
Scopus (70)
Google Scholar

), highlighting the link between feedback and the development of functional behavior in BNNs.

To address the second requirement, a theoretical framework for how intelligent behavior may arise was tested by the DishBrain system. One proposition for how intelligent behavior may arise in an intelligent system embodied in an environment is the theory of active inference via the free energy principle (FEP) (

Friston K.
Breakspear M.
Deco G.

Perception and self-organized instability.

Front. Comput. Neurosci. 2012; 6: 44https://doi.org/10.3389/fncom.2012.00044

Crossref
PubMed
Scopus (112)
Google Scholar

). The FEP suggests a testable implication that at every spatiotemporal scale, any self-organizing system separate from its environment seeks to minimize its variational free energy (VFE) (

Friston K.

The free-energy principle: a unified brain theory?.

Nat. Rev. Neurosci. 2010; 11: 127-138https://doi.org/10.1038/nrn2787

Crossref
PubMed
Scopus (3313)
Google Scholar

;

Palacios E.R.
Razi A.
Parr T.
Kirchhoff M.
Friston K.

On Markov blankets and hierarchical self-organization.

J. Theor. Biol. 2020; 486: 110089https://doi.org/10.1016/j.jtbi.2019.110089

Crossref
PubMed
Scopus (38)
Google Scholar

;

Parr T.
Friston K.J.

Generalised free energy and active inference.

Biol. Cybern. 2019; 113: 495-513https://doi.org/10.1007/s00422-019-00805-w

Crossref
PubMed
Scopus (103)
Google Scholar

). The gap between the model predictions and observed sensations (“surprise” or “prediction error”) may be minimized in two ways: by optimizing probabilistic beliefs about the environment to make predictions more like sensations or by acting upon the environment to make sensations conform to its predictions. This model then implies a common objective function for action and perception that scores the fit between an internal model and the external environment. Under this theory, BNNs hold “beliefs” about the state of the world, where learning involves updating these beliefs to minimize their VFE or actively change the world to make it less surprising (

Parr T.
Friston K.J.

The Discrete and Continuous Brain: From Decisions to Movement—And Back Again.

Neural Comput. 2018; 30: 2319-2347https://doi.org/10.1162/neco_a_01102

Crossref
PubMed
Scopus (28)
Google Scholar

Parr T.
Friston K.J.

Generalised free energy and active inference.

Biol. Cybern. 2019; 113: 495-513https://doi.org/10.1007/s00422-019-00805-w

Crossref
PubMed
Scopus (103)
Google Scholar

). If true, this implies that it should be possible to shape BNN behavior by simply presenting unpredictable feedback following “incorrect” behavior. Theoretically, BNNs should adopt actions that avoid the states that result in unpredictable input. By developing a system that allows for neural cultures to be embodied in a simulated game-world, we are not only able to test whether these cells are capable of engaging in goal-directed learning in a dynamic environment, but we are also able to investigate the foundations of intelligence.

Previous work supports that in vitro neuronal networks can perform blind-source separation in an open-loop environment via state-dependent Hebbian plasticity consistent with the FEP (

Isomura T.
Kotani K.
Jimbo Y.

Cultured Cortical Neurons Can Perform Blind Source Separation According to the Free-Energy Principle.

PLoS Comput. Biol. 2015; 11: e1004643https://doi.org/10.1371/journal.pcbi.1004643

Crossref
PubMed
Scopus (22)
Google Scholar

;

Isomura T.
Friston K.

In vitro neural networks minimise variational free energy.

Sci. Rep. 2018; 8: 16926https://doi.org/10.1038/s41598-018-35221-w

Crossref
PubMed
Scopus (18)
Google Scholar

). We sought to build upon this work to test the theory of active inference, which applies the FEP to sentient systems that not only adapt to fit their environment, but also act upon their environment to fit it to themselves. We therefore hypothesize that when provided a structured external stimulation simulating the classic arcade game “Pong” within the DishBrain system, the BNN would modify internal activity to avoid adopting states linked to unpredictable external stimulation. This minimization of input unpredictability would manifest as the goal-directed control of the simulated “paddle” in this simplified simulated “Pong” environment.

Results

Growth of neuronal “wetware” for computation

Cortical cells from the dissected cortices of rodent embryos can be grown on MEAs in nutrient-rich media and maintained for months (

Bardy C.
van den Hurk M.
Eames T.
Marchand C.
Hernandez R.V.
Kellogg M.
Gorris M.
Galet B.
Palomares V.
Brown J.
et al.

Neuronal medium that supports basic synaptic functions and activity of human neurons in vitro.

Proc. Natl. Acad. Sci. USA. 2015; 112: E2725-E2734https://doi.org/10.1073/pnas.1504393112

Crossref
PubMed
Scopus (200)
Google Scholar

;

Lossi L.
Merighi A.

The Use of ex Vivo Rodent Platforms in Neuroscience Translational Research With Attention to the 3Rs Philosophy.

Front. Vet. Sci. 2018; 5: 164https://doi.org/10.3389/fvets.2018.00164

Crossref
PubMed
Scopus (17)
Google Scholar

). These cultures will develop complicated morphology with numerous dendritic and axonal connections, leading to functional BNNs (

Kamioka H.
Maeda E.
Jimbo Y.
Robinson H.P.
Kawana A.

Spontaneous periodic synchronized bursting during formation of mature patterns of connections in cortical cultures.

Neurosci. Lett. 1996; 206: 109-112https://doi.org/10.1016/S0304-3940(96)12448-4

Crossref
PubMed
Scopus (287)
Google Scholar

;

Wagenaar D.A.
Pine J.
Potter S.M.

An extremely rich repertoire of bursting patterns during the development of cortical cultures.

BMC Neurosci. 2006; 7: 11https://doi.org/10.1186/1471-2202-7-11

Crossref
PubMed
Scopus (544)
Google Scholar

). Primary neural cultures from embryonic day 15.5 (E15.5) mouse embryos were cultured, with representative cultures shown in A . HiPSCs were differentiated into monolayers of active heterogeneous cortical neurons, which have been shown to display mature functional properties (

Denham M.
Parish C.L.
Leaw B.
Wright J.
Reid C. a.
Petrou S.
Dottori M.
Thompson L.H.

Neurons derived from human embryonic stem cells extend long-distance axonal projections through growth along host white matter tracts after intra-cerebral transplantation.

Front. Cell. Neurosci. 2012; 6: 11https://doi.org/10.3389/fncel.2012.00011

Crossref
PubMed
Scopus (33)
Google Scholar

;

Denham M.
Dottori M.

Signals involved in neural differentiation of human embryonic stem cells.

Neurosignals. 2009; 17: 234-241https://doi.org/10.1159/000231890

Crossref
PubMed
Scopus (35)
Google Scholar

;

Shi Y.
Kirwan P.
Livesey F.J.

Directed differentiation of human pluripotent stem cells to cerebral cortex neurons and neural networks.

Nat. Protoc. 2012; 7: 1836-1846https://doi.org/10.1038/nprot.2012.116

Crossref
PubMed
Scopus (514)
Google Scholar

). Using dual SMAD inhibition (DSI) (

Denham M.
Parish C.L.
Leaw B.
Wright J.
Reid C. a.
Petrou S.
Dottori M.
Thompson L.H.

Neurons derived from human embryonic stem cells extend long-distance axonal projections through growth along host white matter tracts after intra-cerebral transplantation.

Front. Cell. Neurosci. 2012; 6: 11https://doi.org/10.3389/fncel.2012.00011

Crossref
PubMed
Scopus (33)
Google Scholar

;

Fattahi F.
Studer L.
Tomishima M.J.

Neural Crest Cells from Dual SMAD Inhibition: Neural Crest Cells from Dual SMAD Inhibition.

in: Bhatia M. Elefanty A.G. Fisher S.J. Patient R. Schlaeger T. Snyder E.Y. Current Protocols in Stem Cell Biology. John Wiley & Sons, Inc., 2015https://doi.org/10.1002/9780470151808.sc01h09s33

Google Scholar

), we developed long-term cortical neurons that formed dense connections with supporting glial cells (B and 2C). Finally, we aimed to expand our study using a different method of hiPSC differentiation—NGN2 direct reprogramming (

Pak C.
Pak C.
Grieder S.
Yang N.
Zhang Y.
Wernig M.
Sudhof T.

Rapid generation of functional and homogeneous excitatory human forebrain neurons using Neurogenin-2 (Ngn2).

Protoc. Exch. 2018; https://doi.org/10.1038/protex.2018.082

Crossref
Google Scholar

;

Zhang Y.
Pak C.
Han Y.
Ahlenius H.
Zhang Z.
Chanda S.
Marro S.
Patzke C.
Acuna C.
Covy J.
et al.

Rapid Single-Step Induction of Functional Neurons from Human Pluripotent Stem Cells.

Neuron. 2013; 78: 785-798https://doi.org/10.1016/j.neuron.2013.05.029

Abstract
Full Text
Full Text PDF
PubMed
Scopus (746)
Google Scholar

)—used in our final part of this study investigating feedback mechanisms. This high-yield method resulted in cells displaying pan-neuronal markers (Figures S1A and S1B). These cells typically display a high proportion of excitatory glutamatergic cells, quantified using qPCR, shown in D. Integration of these neuronal cultures on the HD-MEAs was confirmed via scanning electron microscopy (SEM) on cells that had been maintained for >3 months (E). Densely interconnected dendritic networks could be observed in neuronal cultures forming interlaced networks spanning the MEA area (F). These neuronal cultures appeared to rarely follow the topography of the MEA, being more likely to form large clusters of connected cells with dense dendritic networks (G and 2H). This is likely due to the large size of an individual electrode within the MEA and potentially also chemotactic effects that can contribute to counteract the effect of substrate topography on neurite projections (

Mattotti M.
Alvarez Z.
Ortega J.A.
Planell J.A.
Engel E.
Alcántara S.

Inducing functional radial glia-like progenitors from cortical astrocyte cultures using micropatterned PMMA.

Biomaterials. 2012; 33: 1759-1770https://doi.org/10.1016/j.biomaterials.2011.10.086

Crossref
PubMed
Scopus (45)
Google Scholar

Figure 2Cortical cells form dense interconnected networks

(A and B) Cortical cells from E15 mouse brains and differentiated from hiPSCs, respectively. DAPI in blue stains all cells, NeuN in green shows neurons, beta III tubulin (BIII) marks axons, while MAP2 marks dendrites. Scale bar = 50μm.

(C) GFAP shows supporting astrocytes, critical for long-term functioning; TBR1 marks cortex-specific cells. No Ki67, a marker of dividing cells, was observed with these cultures. Scale bar = 50μm.

(D) Gene expression studies over 28 days demonstrated increased expression of the glutamatergic neural marker, vesicular glutamate transporter 1 (vGLUT1).

(E–G) Neurons differentiated from hiPSCs using the DSI protocol, maintained on MEA for >3 months. White arrows show regions of shrinkage within the cultures, red arrows show bundles of axons, and blue arrows show single neurite extensions. Note the dense coverage over the HD-MEA and overlapping connections extended from neuronal soma present in all cultures across multiple electrodes. Scale bars: E = 200μm, F = 100μm, G = 50μm

(H) Has false coloring to highlight the HD-MEA electrodes beneath the cells. Scale bar = 20μm.

View Large Image
Download Hi-res image
Download (PPT)

Neural cells show well-characterized spontaneous action potentials that develop over time

In vitro development of electrophysiological activity in neural systems at high spatial and temporal resolution was mapped. Robust activity in primary cortical cells from E15.5 rodents was found at days in vitro (DIV) 14 (A and 3E ) where bursts of synchronized activity were regularly observed, as previously demonstrated (

Kamioka H.
Maeda E.
Jimbo Y.
Robinson H.P.
Kawana A.

Spontaneous periodic synchronized bursting during formation of mature patterns of connections in cortical cultures.

Neurosci. Lett. 1996; 206: 109-112https://doi.org/10.1016/S0304-3940(96)12448-4

Crossref
PubMed
Scopus (287)
Google Scholar

;

Wagenaar D.A.
Pine J.
Potter S.M.

An extremely rich repertoire of bursting patterns during the development of cortical cultures.

BMC Neurosci. 2006; 7: 11https://doi.org/10.1186/1471-2202-7-11

Crossref
PubMed
Scopus (544)
Google Scholar

). In contrast, similar to previous reports (

Shi Y.
Kirwan P.
Livesey F.J.

Directed differentiation of human pluripotent stem cells to cerebral cortex neurons and neural networks.

Nat. Protoc. 2012; 7: 1836-1846https://doi.org/10.1038/nprot.2012.116

Crossref
PubMed
Scopus (514)
Google Scholar

), synchronized bursting activity was not observed in cortical cells from an hiPSC background differentiated using DSI until DIV 73 (A and 3F). HiPSCs differentiated using NGN2 direct reprogramming showed activity much earlier, typically between days 14 and 24 (A and 3G). Electrophysiological maturation was monitored with daily activity scans. While max firing rate typically increased and remained relatively stable over time for all cell types during the testing period (B), changes were observed in both the mean firing rate (C) and variance in firing rate (D) over the days of testing; in particular, hiPSCs differentiated using the NGN2 direct reprogramming method showed a considerable increase in mean firing rate and the variance in firing over days of testing.

Figure 3Cortical cells display spontaneous electrophysiological activity

Shaded error = 95% confidence intervals.

(A) Firing rate for E15.5 primary rodent cortical cells, hiPSC cells differentiated to cortical neurons via DSI, and hiPSC cells differentiated via NGN2 direct differentiation. Note different time points for each cell type. Scale bar displays firing frequency (Hz) from 0.0 to 1.0.

(B) Max firing was consistently different between cortical cells from a primary source and cortical cells differentiated from hiPSCs.

(C and D) Mean activity between hiPSCs differentiated using DSI and primary cortical cultures was generally similar, while hiPSCs differentiated using the NGN2 method continued to increase. This is reflected in (D), where the former two cell types displayed minimal changes in the variance in firing within a culture, while the latter increased variance over time.

(E, F, and G) Showcases raster plots over 50 s, where each dot is a neuron firing an action potential colored to help distinguish channel firing and stars indicate time points with observed bursting activity. Note the differences between mid-stage cortical cells from a DIV14 primary rodent culture (E) compared with more mature DIV73 human cortical cells (F) differentiated from iPSCs using the DSI and NGN2 direct differentiated neurons (G) approach described in text, in terms of synchronized activity and stable firing patterns. While all display synchronized activity, there is a difference in the overall levels of activity represented in (B–D).

View Large Image
Download Hi-res image
Download (PPT)

Building a modular, real-time platform to harness neuronal computation

The DishBrain system was developed to leverage neuronal computation and interact with neurons embodied in a simulated environment (; A; Video S2). The DishBrain environment is a low-latency, real-time system that interacts with the vendor MaxOne software, allowing it to be used in ways that extend its original functions (B). This system can record electrical activity in a neuronal culture and provide “sensory” (non-invasive) electrical stimulation comparably to the generation of action potentials by activity in the neuronal network (

Ruaro M.E.
Bonifazi P.
Torre V.

Toward the neurocomputer: image Processing and pattern recognition with neuronal cultures.

IEEE Trans. Biomed. Eng. 2005; 52: 371-383https://doi.org/10.1109/TBME.2004.842975

Crossref
PubMed
Scopus (100)
Google Scholar

). Using the coding schemes described in , external electrical stimulations convey a range of information. For our purposes, we opted for three distinct information categories: predictable, random, and sensory (, C). DishBrain (Figure S2) was designed to integrate these functions to “read” information from and “write” sensory data to a neural culture in a closed-loop system so neural “action” influences future incoming “sensory” stimulation in real time. The intent was to embody BNNs in a virtual environment and to quantify demonstrable learning effects.

Figure 4Schematics and pilot testing with increasing informational density

(A) Diagrammatic overview of DishBrain setup.

(B) Software components and data flow in the DishBrain closed-loop system. Voltage samples flow from the MEA to the “Pong” environment, and sensory information flows back to the MEA, forming a closed loop. Full caption in Figure S2.

(C) Schematic showing the different phases of stimulation to the culture. In line with this is the corresponding summed activity on the raster plot over 100 seconds. The appearance of random stimulation after a ball missing versus system-wide predictable stimulation upon a successful hit is apparent across all three representations. Corresponding images on the right show the position of the ball on both x and y axis relative to the paddle and back wall in percentage of total distance shown on the same timescale.

(D) Final electrode layout schematic for DishBrain Pong-world gameplay.

(E) ∗ = p < 0.05, ∗∗∗ = p < 0.001; error bars = 95% CI. Shows average rally length over three distinct experiment rounds during design of DishBrain Pong-world where each subsequent experiment provided higher density information on ball position than the previous. MCC tested over 272 sessions, n = 50 biological replicates; HCC tested over 579 sessions, n = 18 biological replicates.

View Large Image
Download Hi-res image
Download (PPT)

eyJraWQiOiI4ZjUxYWNhY2IzYjhiNjNlNzFlYmIzYWFmYTU5NmZmYyIsImFsZyI6IlJTMjU2In0.eyJzdWIiOiI0ZTM5ZTgxODQ3OWNlZjJhNTk4YTk2NDlmOWU5Yjc5MyIsImtpZCI6IjhmNTFhY2FjYjNiOGI2M2U3MWViYjNhYWZhNTk2ZmZjIiwiZXhwIjoxNjcyODE0MTU2fQ.m7jEuL_hLvBxtZwAMG5_BozD8KDdWffmBohvpQnuGvXfvfZ-P8LWWs9qL9lEMERhWpZs0j9xGNyiCUoMfM_v0PNHiPzAQjIHc5F0LRCwvnd9jFySqScJZiJbGXR8TmcKBq3h-M7R_y87ueE4pe2QhQ_GRQpztgRDudMH83im169w9h_rZRuuaFQLsypFcAn63bLO4P79BNEleQIlD2936e959ExIaeX3FEw0pmjdNhA_TyNHXywwlacVDINU_df_jPd-AStDexwuumirtXYFl1p9-3lF5T3f-n9QHGiyturahwk8qQyuYPURDDVEoIvkpziHIM6HfwkM_fkvayc3BQ
Download .mp4 (87.19 MB)
Help with .mp4 files
Video S2. Representative movie of interactive SpikeStream visualizer and overview of system
setup, related to Figure 5
Representative movie of a paddle being controlled by the activity of living neurons to play a simulated game of “Pong” in the SpikeStream interactive visualizer with associated descriptions of the methods and summary of key results. This is also available live in real time from any active culture in the DishBrain system

The initial proof of principle using DishBrain was to simulate the classic arcade game “Pong” by delivering inputs to a predefined sensory area of 8 electrodes (D). Electrodes were arranged in a manner that would allow a coarse, yet topographically consistent, place coding, consistent with in vivo systems (see ) (

Baranes K.
Chejanovsky N.
Alon N.
Sharoni A.
Shefi O.

Topographic cues of nano-scale height direct neuronal growth pattern.

Biotechnol. Bioeng. 2012; 109: 1791-1797https://doi.org/10.1002/bit.24444

Crossref
PubMed
Scopus (74)
Google Scholar

;

Patel G.H.
Kaplan D.M.
Snyder L.H.

Topographic organization in the brain: searching for general principles.

Trends Cogn. Sci. 2014; 18: 351-363https://doi.org/10.1016/j.tics.2014.03.008

Abstract
Full Text
Full Text PDF
PubMed
Scopus (38)
Google Scholar

;

Shlens J.
Field G.D.
Gauthier J.L.
Grivich M.I.
Petrusca D.
Sher A.
Litke A.M.
Chichilnisky E.J.

The Structure of Multi-Neuron Firing Patterns in Primate Retina.

J. Neurosci. 2006; 26: 8254-8266https://doi.org/10.1523/JNEUROSCI.1282-06.2006

Crossref
PubMed
Scopus (334)
Google Scholar

). The electrophysiological activity of defined motor regions was gathered—in real time—to move a paddle. If this activity did not result in an interception of the ball by the paddle, an unpredictable stimulus was delivered (150mV voltage at 5Hz for 4 seconds; see ), after which time the ball stimulation would recommence on a random vector. In contrast, if a successful interception occurred, a predictable stimulus was delivered across all electrodes simultaneously at 100Hz for 100ms (briefly interrupting the regular sensory stimulation) before the game continued predictably. Preliminary investigations compared different motor region configurations to verify that motor region setup did not introduce bias (paddle movement that aligned to the ball position) from input stimulation alone (; Figure S3). Experimental cultures of cortical cells showed a higher hit-miss ratio, which we defined as the average rally length, on counterbalanced split-motor configurations (D), where media-only-filled MEAs used as a control group also showed minimal bias. Distinct areas were defined as “motor regions,” where activity in motor region action 1 moved the paddle “up” and activity in motor region action 2 moved the paddle “down.” This fixed layout means that monolayers of cells—with a random distribution that is arbitrary in relation to the “motor” configuration—will need to adopt distinct firing patterns through self-organization (and raises the question to what extent this self-organization will occur).

Increasing the density of sensory information input leads to increased performance

The DishBrain protocol was refined over three pilot studies, each increasing the density of sensory information. Pilot study 1 operated with a 4Hz stimulation that only involved place coding, where the location of the stimulation corresponded to the position of the ball on the y axis. Pilot study 2 investigated different configurations and introduced activity-based weighting to motor regions to account for cell density or activity differences. Pilot study 3 adopted the layout in D and changed to the combined rate (4–40Hz) and place-coding method of data input. This combined rate and place coding has compelling biological similarities conceptually to the rodent barrel cortex, suggesting this encoding is physiologically coherent (

Harrell E.R.
Goldin M.A.
Bathellier B.
Shulz D.E.

An elaborate sweep-stick code in rat barrel cortex.

Sci. Adv. 2020; 6: eabb7189https://doi.org/10.1126/sciadv.abb7189

Crossref
PubMed
Scopus (2)
Google Scholar

;

Ly C.
Middleton J.W.
Doiron B.

Cellular and Circuit Mechanisms Maintain Low Spike Co-Variability and Enhance Population Coding in Somatosensory Cortex.

Front. Comput. Neurosci. 2012; 6https://doi.org/10.3389/fncom.2012.00007

Crossref
PubMed
Scopus (26)
Google Scholar

;

Petersen R.S.
Panzeri S.
Diamond M.E.

Population Coding of Stimulus Location in Rat Somatosensory Cortex.

Neuron. 2001; 32: 503-514https://doi.org/10.1016/S0896-6273(01)00481-0

Abstract
Full Text
Full Text PDF
PubMed
Scopus (193)
Google Scholar

). Gameplay for the final fifteen minutes for each culture type was compared (E and Table S1). Cultures displayed a significant increase in the average rally length between the second and final pilot studies and the first and final pilot studies. Between cultures, human cortical cells (HCCs) had significantly longer average rally lengths than cultures with mice cortical cells (MCCs) (Table S2). Overall, these results support that increasing the amount of sensory information improved performance, even when cell culture features were kept constant.

BNNs learn over time when embodied in a gameplay environment

To test the predictions of the FEP (A ) using selected parameters (), cortical cells (MCCs and HCCs) were compared with media-only controls (CTL); rest sessions (RST), where active cultures controlled the paddle but received no sensory information; and in-silico (IS) controls that mimicked all aspects of the gameplay except the paddle were driven by random noise over 399 test sessions (80-CTL [n = 6 MEA], 42-RST [n = 20 cultures], 38-IS [n = 3 seeds], 101-MCCs [n = 9 cultures], 138-HCCs [n = 11 cultures]). The average rally length showed a significant interaction (B and Table S1) between group and time (first 5 and last 15 min). Only the MCC and HCC cultures showed evidence of learning with significantly increased rally lengths over time. Further, it was found that during gameplay in timepoint 1 (T1), key significant differences were observed (Table S1): the HCC group performed significantly worse than MCC, CTL, and IS groups (Table S2). This suggests that HCCs perform worse than controls when first embodied in an environment, suggesting an initial maladaptive control of the paddle or perhaps an exploratory behavior. Notably, at timepoint 2 (T2), this trend was reversed; the MCC and HCC groups significantly outperformed all control groups along with HCC showing a slight but significant outperformance over the MCC group (Tables S1 and S2). This data demonstrates a significant learning effect in both experimental groups absent in the control groups, along with evidence that the learning capabilities differ between mice and human cells in line with previous results (Video S1).

Figure 5Embodied cortical neurons show significantly improved performance in “Pong” when embodied in a virtual game-world

399 test-sessions were analyzed with biological replicates: 80-CTL (n = 6), 42-RST (n = 20), 38-IS (n = 3), 101-MCCs (n = 9), 138-HCCs (n = 11). Significance bars show within-group differences denoted with ∗. Symbols show between-group differences at the given timepoint: # = versus HCC; % = versus MCC; ˆˆ = versus CTL; @ = versus IS. The number of symbols denotes the p value cutoff, where 1 = p < 0.05, 2 = p < 0.01, 3 = p < 0.001, and 4 = p < 0.0001. Boxplots show interquartile range, with bars demonstrating 1.5× interquartile range, the line marks the median, and ▲ marks the mean.

(A) Schematic of how neurons may engage in the game-world under active inference denoting a gradient flow on variational free energy, expressed in terms of neural activity minimizing prediction errors. ε is prediction error, ξ represents a precision-weighted prediction error. Precision can be regarded as a Kalman gain in Kalman filtering; ‘a’ corresponds to action.

(B–D) Experimental groups according to time point 1 (T1; 0–5 min) and time point 2 (T2; 6–20 min).

(B) Average performance between groups over time, where only experimental (MCC: t = 6.15, p = 5.27−08 and HCC: t = 10.44, p = 3.92−19) showed significant improvement and higher average rally length against all control groups at T2.

(C) Average number of aces between groups and over time, only MCC (t = 2.67, p = 0.008) and HCC (t = 5.95, p = 2.13−08) differed significantly over time. The RST group had significantly more aces compared with the CTL, IS, MCC, and HCC groups at T1 and compared with the CTL, MCC, and HCC at T2. Only MCCs and HCCs showed significant decreases in the number of aces over time, indicating learning. At T2 they also showed fewer aces compared with the IS group, but only the HCC group was significantly less than CTL.

(D) Average number of long rallies (>3) performed in a session. At T1, the HCC group had significantly fewer long rallies compared with all control groups (CTL, IS, and RST). However, both the MCC (t = 5.55, p = 2.36−07) and HCC (t = 10.38, p = 5.27−19) groups showed significantly more long rallies over time. By T2, the HCC group displayed significantly more long rallies compared with the IS group. The HCC group also displayed significantly more long rallies compared with all CTL, IS, and RST control groups.

(E) The average distance that the paddle moved during a session was found to have no obvious relationship with average rally length as the IS control groups showed a higher movement than the experimental groups, while CTL and RST were lower. As such, the observed learning effects are not likely due to stimulation, leading to increased activity of paddle movement.

(F) Distribution of frequency of mean summed hits per minute among groups shows obvious differences; scale bar shows the probability the number of hits in the given minute under that condition.

View Large Image
Download Hi-res image
Download (PPT)

eyJraWQiOiI4ZjUxYWNhY2IzYjhiNjNlNzFlYmIzYWFmYTU5NmZmYyIsImFsZyI6IlJTMjU2In0.eyJzdWIiOiIyMWNmYTY5NmY0MjMwZGNkMmUxZDNiMjJiM2IxMGFmYiIsImtpZCI6IjhmNTFhY2FjYjNiOGI2M2U3MWViYjNhYWZhNTk2ZmZjIiwiZXhwIjoxNjcyODE0MTU2fQ.ql3_qROEF38HYG7fim8pa9hYCUjm-tNXApuVtAOJrRonAVUdLMV9VhMdyvIarbbOuB3nr_Tc9fT8jXL8R_B13serFJI-lfbpYD-p_v9JSHGKOURixadekGqvwvj-q4hoXMvENfGDUI7xiqqyQULNjRbzI8I_t1_OlaUcZEFt9wM913U5npbSIjJkaPNG4GgpDCRGBv93ErO0fbKpXa8s5ves37jGuF9VNQ3BKJcMXbThzDYOnunFHegajzu5ZxQksHHLMfPRBD-Txw8Uo4EvChhIfVjQ5GhqgyjblK9xTBn-Lw3KYXB22PomuPOeqAfppdOgWCMDVmpNJHfcmcN6lw
Download .mp4 (0.18 MB)
Help with .mp4 files
Video S1. Representative movie of DishBrain system in
action, related to Figure 1
Representative video of a paddle being controlled by the activity of living neurons to play a simulated game of “Pong.” It is of particular interest to note how frequently after a successful hit the paddle leads where the ball will eventually end up on the return, even before the ball hits the backwall

Learning effects in BNNs are observed across additional measures

Other key gameplay characteristics, such as the number of times the paddle failed to intercept the ball without a single hit defined as “aces,” and the number of gameplays with greater than 3 consecutive hits defined as “long rallies,” were calculated. As with average rally length, significant interactions between groups and time were found for aces and long rallies (Table S1). Only the MCC and HCC groups showed significantly fewer aces in T2 compared with T1 (C and Table S2). Likewise, only the MCC and HCC groups showed significantly more long rallies in T2 compared with the first (D and Table S2). Collectively, the data shows that both experimental cultures (HCCs and MCCs) improved performance by reducing how often they missed the initial serve and achieving more consecutive hits or longer rallies.

Differences between groups at T1 were found both for aces and long rallies (Table S1). The RST condition displayed significantly more aces than the CTL and MCC groups (Table S2), suggesting a degree of sporadic behavior that the cells exhibit when initially introduced to the rest period from gameplay that results in this behavior. When the number of long rallies at T1 was investigated, it was found that only HCCs had significantly fewer long rallies (Table S2). This finding complements the reduced average rally lengths discussed above. Significant differences between groups at T2 were also found for aces and long rallies (C and 5D and Table S1). Notably, the HCC group showed significantly fewer aces than CTL, RST, and IS groups (Table S1). The MCC group also showed significantly fewer aces than RST and IS groups, but not the CTL group (Table S2). In contrast, for long rallies, the MCC group showed significantly more than the CTL, RST, and IS groups (Table S2), yet the HCC group only showed significantly more long rallies compared with the IS group, but not RST or CTL (Table S2).

No learning effect was found in electrically inactive non-neural cells (HEK293T cells) and media-only controls (Figures S4A–S4C). Further, a significant negative correlation between percentage of aces and percentage of long rallies of both MCCs and HCCs was found, suggesting that the performance was not arising from maladaptive behavior such as fixing the paddle to a single corner (Figure S4D). Whether stimulation alone may cause greater movement of the paddle and that this may result in the observed learning effects was also investigated. As E shows, while there were significant differences observed in paddle movement between conditions (Table S1), for the CTL and RST, this resulted in significantly lower movement relative to the other groups, with the RST being the lowest movement of all groups (Table S2). The IS control group showed significantly more paddle movement than all other groups yet displayed no meaningfully different performance metrics to the other control groups (CTL and RST) (Table S2). Additionally, Figure S4E shows no significant correlation between paddle movement and average rally length was observed, supporting that movement alone of the paddle does not explain the observed learning effects. Wholistically, F emphasizes that both MCCs and HCCs showed fewer aces and more long rallies in T2 compared with T1, reiterating the observed learning effect over time. This can also be seen in linear regressions (Figure S4F), where only the MCC and HCC groups showed a statistically significant positive relationship between average rally length and duration of gameplay.

BNNs require feedback for learning

To investigate the importance of the feedback type for learning, cultures, both MCCs and HCCs, were tested under 3 conditions for 3 days, with 3 sessions per day resulting in a total of 486 sessions. Condition 1 (Stimulus; n = 27) mimicked that used above, where predictable and unpredictable stimuli were administered when the cultures behaved desirably or not, respectively. Condition 2 (Silent; n = 17) involved the stimulus feedback being replaced with a matching time period in which all stimulation was withheld, after which the game restarted with the ball beginning in a random direction. Condition 3 (No feedback; n = 15) removed the restart after a miss. When the paddle did not successfully intercept the ball, the ball would bounce and continue without interruption; the stimulus reporting ball position was still provided. The difference between these conditions is illustrated in A . Rest-period activity was also gathered and used to normalize performance per session basis to account for differences in unstimulated activity ().

Figure 6The importance of feedback in learning

486 sessions were analyzed. Significance bars show within-group differences denoted with ∗. Symbols show between-group differences at the given timepoint: # = versus Stimulus; % = versus Silent. The number of symbols denotes the p value cutoff, where 1 = p < 0.05, 2 = p < 0.01, 3 = p < 0.001, and 4 = p < 0.0001. Box plots show interquartile range, with bars demonstrating 1.5× interquartile range, the line marks the median, and ▲ marks the mean. Errors bands = 1 SE.

(A) Schematic showing the stimulation from the 8 sensory electrodes across 40 s of the same gameplay for each of the three conditions. The bar below color codes what phase of stimulation is being delivered, where random stimulation follows a miss and predictable stimulation follows a hit in the Stimulus condition. Note the corresponding absence of any stimulation in the Silent condition and the lack of any change in sensory stimulation in the No-feedback condition.

(B) Displays the probability of a certain number of hits occurring in a group at a specific minute.

(C) Using different feedback schedules, the Stimulus feedback condition showed significant learning (as in A; t = 7.48, p = 1.58−12) and outperformed Silent and No-feedback average rally length. Silent feedback also showed higher performance compared with these groups at T2.

(D) Displays difference seen in (C) across day.

(E) Shows similar differences versus rest performance for aces across conditions, where the Stimulus group showed significantly fewer aces across time (t = 3.21, p = 0.002).

(F) Displays data from (E) across day.

(G and H) Shows that the Stimulus condition showed significant increase (t = 3.21, p = 0.002) across timepoints; however, as in (H), no differences were found across time for long rallies.

View Large Image
Download Hi-res image
Download (PPT)

Stimulus and Silent conditions showed an overall higher average rally length compared with Rest and No-feedback conditions (B). When testing for differences between groups in the percentage increase of average rally length over matched rest controls, a significant interaction was found (C and Table S1). Only the Stimulus condition showed a significant increase in average rally length over time. While no differences were found for T1, a significant main effect of group was found at T2, where the Stimulus condition had a significantly higher average rally length than the Silent and No-feedback conditions (Table S2). Interestingly, the Silent condition also significantly outperformed the No-feedback conditions, although with a smaller effect size (Table S2). Importantly, this demonstrates that information alone is insufficient; feedback is required to form a closed-loop learning system. When followed up at the level of day for T2 (D), no significant differences over time were observed, but the same between-group differences as above were observed. This trend was similar when looking at aces both summed (E) and across days of testing (F). The Stimulus group at T1 showed significantly fewer long rallies compared with the Silent and No-feedback condition, being reversed at T2 with the Stimulus group showing significantly more long rallies compared with the No-feedback condition (G). No difference was found when this was followed up across days (H). Collectively, these results suggest that adaptive behavior seen in BNNs altering electrophysiological activity can be an emergent property of engaging with—and implicitly modelling—the environment.

Dynamics in electrophysiological activity display coherent connectivity

Electrophysiological activity during gameplay was analyzed from cultures subjected to the stimulus condition to determine functional connectivity (

Mohseni Ahooyi T.
Shekarabi M.
Decoppet E.A.
Langford D.
Khalili K.
Gordon J.

Network analysis of hippocampal neurons by microelectrode array in the presence of HIV-1 Tat and cocaine.

J. Cell. Physiol. 2018; 233: 9299-9311https://doi.org/10.1002/jcp.26322

Crossref
PubMed
Scopus (11)
Google Scholar

). The cross correlations of firing in 100ms-time bins revealed significant, strong positive correlations between activity in the sensory region and both motor regions during Rest and Gameplay (A–7D ). However, when these correlations were calculated per bin and averaged, significantly stronger correlations were observed when cultures were in the Gameplay phase than at Rest (E). This higher degree of connectivity would be expected if activity in the sensory region during gameplay was directly related to activity in motor regions through dynamic self-organization at the system-wide level. In line with this, when the quantity of exclusive motor region activity was calculated per second—looking for events where above-noise-level activity occurred in either motor region 1 or motor region 2, yet not both simultaneously—a significant increase in these events was found when cultures were engaged in gameplay versus rest (F). This type of internal modulation is coherent with the observed performance of these cultures; exclusive activity changes among motor regions would be required for adaptive gameplay. Finally, to further support these results, the correlation between the two motor regions was found to vary substantially over time (G). A linear regression of the correlation in 100ms-time bins between motor regions was found to decrease with time significantly until approximately 5 min of gameplay (R2 = 0.013, F(1, 2049) = 27.51, p = 1.72−7, β = −1.18, p < 0.001). After this point, little further change was observed (R2 = 0.00, F(1, 5181) = 2.19, p = 0.139, β = −0.55, p = 0.139), suggesting a degree of homeostasis. These differences do not affect the overall average culture firing that remains stable throughout the gameplay session (H).

Figure 7Electrophysiological activity during Gameplay and Rest

579 sessions (358 Gameplay, 221 Rest) were analyzed with n = 43 biological replicates. Significance bars show within-group differences denoted with ∗. Symbols show between-group differences at the given timepoint: # = versus Gameplay or Stimulus; % = versus Silent. The number of symbols denotes the p-value cutoff, where 1 = p < 0.05, 2 = p <0.01, 3 = p < 0.001, and 4 = p <0.0001. Box plots show interquartile range, with bars demonstrating 1.5× interquartile range, the line marks the median, and ▲marks the mean. Error bands = 1 SE.

(A–D) A significant positive correlation between mean firing and performance was found between motor region 1 and 2 with the Sensory area both during Rest (A and B) and Gameplay (C and D).

(E) The average cross-sensory motor correlation was significantly less during Rest, both for motor region 1 (t = 30.40, p = 6.61−194) and motor region 2 (t = 29.76, p = 2.76−186) than during Gameplay.

(F) The percentage of mutually exclusive activity events per second across motor regions was calculated and found to increase significantly during Gameplay versus Rest (t = 14.64, p = 5.68−48).

(G) The correlation between the two motor regions showed substantial changes over time (blue). Linear regression conducted on the first 5 min of Gameplay (orange) showed a significant negative relationship between variables that was absent in the final 15 min (teal).

(H) Activity over time showed no significant changes while engaged in Gameplay (r = −0.01, p = 0.563), supporting that any observed learning effects over time were not related to merely gross changes in activity levels across the cultures over time.

(I) Functional plasticity was assessed across cultures when engaged in Gameplay versus Rest, with a significant increase in functional plasticity found during gameplay.

(J) Following random stimulation feedback, there was a significant increase in the mean information entropy during Gameplay (t = 4.890, p = 2.024−6), yet the corresponding time during Rest showed no change (t = 0.016, p = 0.987). Mean information entropy was lower at both pre- (t = 9.781, p = 3.882−19) and post- (t = 5.915, p = 1.178−8) feedback during Gameplay than at Rest.

(K) For normalized mean information entropy, the difference relative to feedback period was increased during Gameplay (t = 19.337, p = 3.476−48), yet still no difference was observed during Rest where no feedback was delivered (t = 1.022, p = 0.316). Normalized mean information entropy was lower at pre- (t = 10.192, p = 2.139−20), but not post- (t = 0.671, p = 0.503) feedback, during Gameplay compared with Rest.

(L) Feedback-related changes in normalized mean information entropy were assessed for the investigation of different feedback mechanisms. Increases following random feedback for the Stimulus condition were replicated (t = 9.623, p = 7.887−19); it was also found that the system displayed increased activity-related scores under the Silent condition feedback (t = 21.538, p = 7.019−47). The No-feedback condition showed no change in normalized mean information entropy at matched times after Bonferroni corrections (t = 10.192, p = 0.030). Post-hoc follow-up tests found no differences between Stimulus and Silent conditions during gameplay; both were significantly lower than for the No-feedback condition. After feedback, the Stimulus and Silent conditions were significantly higher than the No-feedback condition, with the Silent condition significantly higher than the Stimulus condition.

View Large Image
Download Hi-res image
Download (PPT)

As electrical stimulation of neural tissue has been shown to modify neuronal activity (

Bakkum D.J.
Chao Z.C.
Potter S.M.

Spatio-temporal electrical stimuli shape behavior of an embodied cortical network in a goal-directed learning task.

J. Neural. Eng. 2008; 5: 310-323https://doi.org/10.1088/1741-2560/5/3/004

Crossref
PubMed
Scopus (80)
Google Scholar

Bakkum D.J.
Chao Z.C.
Potter S.M.

Long-Term Activity-Dependent Plasticity of Action Potential Propagation Delay and Amplitude in Cortical Networks.

PLoS One. 2008; 3: e2088https://doi.org/10.1371/journal.pone.0002088

Crossref
PubMed
Scopus (81)
Google Scholar

;

Chao Z.C.
Bakkum D.J.
Potter S.M.

Shaping Embodied Neural Networks for Adaptive Goal-directed Behavior.

PLoS Comput. Biol. 2008; 4: e1000042https://doi.org/10.1371/journal.pcbi.1000042

Crossref
PubMed
Scopus (50)
Google Scholar

), the functional plasticity of cultures during Gameplay was assessed compared with when at Rest as described in . I suggests that closed-loop training during Gameplay displays significantly increased plasticity compared with baseline plasticity measured at Rest before training, indicating that functional plasticity was upregulated during gameplay (Table S1). To test whether learning reflects a reduction in VFE within BNNs, we used the information entropy of neuronal responses as a proxy for the average surprise (a.k.a. self-information), which is upper-bounded by VFE (see ). We predicted a reduction in information entropy during the learning of gameplay. We further predicted an increase in entropy following unpredictable (random) feedback, reflecting and ensuing state of “surprise” (and, implicitly, high VFE), relative to pre-feedback states. For the studies reported in , the mean information entropy was found to be lower during Gameplay than during Rest, both before and after the unpredictable feedback stimulation (J and Table S1). There was a significant increase in mean information entropy found post-feedback relative to pre-feedback timepoints during Gameplay, but not in the corresponding timepoints during Rest where no feedback occurred. As the change in entropy can depend on the level of sensory activity pre-feedback, we normalized the mean information entropy by the number of spikes. The relationship was conserved (K and Table S1), where a significant increase in normalized mean entropy was observed during Gameplay, but not at the corresponding timepoint during Rest where no stimulation occurred. In short, as predicted theoretically, gameplay reduced information entropy during predictable exchanges with the environment, while unpredictable feedback increased entropy during gameplay.

We repeated this analysis on the follow-up study of different feedback mechanisms reported in . While it is important to note that the internal information entropy of the culture is not necessarily and directly tied to the external (i.e., sensory) information entropy of the stimulus being applied into a culture, it is interesting to see how cultures respond to different feedback protocols. As shown in L, the change during the stimulus condition between the normalized mean information entropy was replicated for the standard Stimulus condition (Table S1). Of interest is the finding that during the Silent condition, the neural cultures had a higher normalized mean information entropy than even the stimulus condition post-feedback. However, the No-feedback condition showed no change relative to the period when feedback would have been applied, with a significantly higher normalized mean information entropy score than either of the other two conditions pre-feedback, yet a significantly lower score post-feedback (Table S2).

Electrophysiological activity is linked with higher average rally length

Exploratory uncorrected Pearson’s correlations were computed for key electrophysiological activity metrics and average rally length. A significant positive correlation was found between average rally length with mean (A ) and max (B) firing. Likewise, the cross-correlations with the sensory region for both motor region 1 (C) and 2 (E) were significantly positively correlated with performance, further suggesting that robust connectivity is linked with better gameplay outcomes. To further investigate whether the topographical distribution of activity correlated with performance, the absolute values of four discrete cosine transform (DCT) coefficients normalized to mean activity were used to summarize spatial modes of spontaneous activity and assess the symmetry of activity (E). DCT 0,1, which measures activity across the horizontal plane (F), and DCT 2,0, which measures activity on the horizontal edge versus the horizontal center (I), were significantly negatively correlated with average rally length. Yet, DCT 0,2, which shows difference between activity on the vertical edges and the vertical center (G), and DCT 1,0 which measures activity across the vertical plane (H), did not significantly correlate. Given configuration layout, it is coherent that gameplay performance is closely linked to deviations in symmetry of electrophysiological activity. To confirm the importance of symmetry, gameplay electrophysiological activity was analyzed for both motor regions, and the normalized deviation away from symmetry was calculated. As deviation away from symmetry resulted in a significant negative correlation with the average rally length, any asymmetry exceeding approximately 1 deviation appeared to completely prevent performance above that observed in controls (J). This suggests a limit to which cultures can self-organize spontaneous activity if cell culture quality is uneven. Finally—in line with the results above—higher activity in the sensory region (K), motor region 1 (L), and motor region 2 (M) during gameplay was also correlated with higher average rally lengths.

Figure 8Relationship between electrophysiological activity and average rally length

302 gameplay sessions were analyzed after filtering outliers (Z score > ±3.29) from rallies with n = 30 biological replicates.

(A) The mean spontaneous activity (Hz) over all electrodes showed a significant positive correlation with average rally length.

(B–D) Similarly, the max spontaneous firing (Hz) also showed a significant positive correlation with average rally length. In line with this, the average cross correlation between the sensory region and both motor region 1 (C) and motor region 2 (D) had a significant positive correlation with average rally length.

(E) The DCT scores of four different basis functions were calculated to quantify asymmetry in spontaneous activity. DCT scores were normalized to mean activity. The scale bar shows the value assigned to activity in the given area, where each DCT basis function quantifies a different type of asymmetry per pixel from −0.010 to 0.010.

(F–H) Displays the significant negative correlation between DCT 0,1 and average rally length, showing that asymmetry on the horizontal axis is related to poorer performance. There was no significant relationship between DCT 0,2 (G), which measured asymmetry on the horizontal extremes compared with the center, or DCT 1,0 (H), which measured asymmetry on the vertical axis.

(I–M) DCT 2,0 function displayed a significant negative correlation with average rally length, suggesting that asymmetry on the vertical edges compared with the middle was linked to poorer gameplay performance. In line with this, (J) displays the calculated deviation from symmetry in activity between motor regions during gameplay and finds a significant negative association, where greater asymmetry was linked to lower average rally lengths. Similarly, during gameplay the activity in the sensory (K), motor region 1 (L), and motor region 2 (M) all showed significant positive correlations with average rally length.

View Large Image
Download Hi-res image
Download (PPT)

Discussion

Here, we present the DishBrain system, a system capable of embodying BNNs from various sources in a virtual environment and measuring their responses to stimuli in real time. The ability of neurons, especially in assemblies, to respond to external stimuli adaptively is well established in vivo as it forms the basis for all animal learning (

Attinger A.
Wang B.
Keller G.B.

Visuomotor Coupling Shapes the Functional Development of Mouse Visual Cortex.

Cell. 2017; 169: 1291-1302.e14https://doi.org/10.1016/j.cell.2017.05.023

Abstract
Full Text
Full Text PDF
PubMed
Scopus (70)
Google Scholar

). However, this work is the first to establish this fundamental behavior in vitro for a goal-directed behavior. We were able to use this silico-biological system to investigate the fundamentals of biological neuronal computation. In brief, we introduce the first SBI device to demonstrate adaptive behavior in real time. The system itself offers opportunities to expand upon previous in silico models of neural behavior, such as where models of hippocampal and entorhinal cells were tested in solving spatial and non-spatial problems (

Whittington J.C.R.
Muller T.H.
Mark S.
Chen G.
Barry C.
Burgess N.
Behrens T.E.J.

The Tolman-Eichenbaum Machine: Unifying Space and Relational Memory through Generalization in the Hippocampal Formation.

Cell. 2020; 183: 1249-1263.e23https://doi.org/10.1016/j.cell.2020.10.024

Abstract
Full Text
Full Text PDF
PubMed
Scopus (85)
Google Scholar

). Minor variations on the DishBrain platform, selected cell types, drug administration, and feedback conditions would enable an in vitro test to garner data on how cells process and compute information that was previously unattainable.

Most significantly, this work presents a substantial technical advancement in creating closed-loop environments for BNNs (

Bakkum D.J.
Chao Z.C.
Potter S.M.

Spatio-temporal electrical stimuli shape behavior of an embodied cortical network in a goal-directed learning task.

J. Neural. Eng. 2008; 5: 310-323https://doi.org/10.1088/1741-2560/5/3/004

Crossref
PubMed
Scopus (80)
Google Scholar

;

Chao Z.C.
Bakkum D.J.
Potter S.M.

Shaping Embodied Neural Networks for Adaptive Goal-directed Behavior.

PLoS Comput. Biol. 2008; 4: e1000042https://doi.org/10.1371/journal.pcbi.1000042

Crossref
PubMed
Scopus (50)
Google Scholar

;

Wagenaar D.A.
Pine J.
Potter S.M.

Effective parameters for stimulation of dissociated cultures using multi-electrode arrays.

J. Neurosci. Methods. 2004; 138: 27-37https://doi.org/10.1016/j.jneumeth.2004.03.005

Crossref
PubMed
Scopus (242)
Google Scholar

). We have emphasized the requirement for embodiment in neural systems for goal-directed learning to occur. This is seen in the relative performance over experiments, where denser information and more diverse feedback impacted performance. Likewise, when no feedback was provided yet information on ball position was available, cultures showed significantly poorer performance and no learning. Of particular interest was the finding that when stimulatory feedback was removed and replaced with silent feedback (i.e., transient removal of all stimuli), cultures were still able to outperform those with no feedback as in the open-loop condition, albeit to a lesser extent. One interpretation is that playing “Pong” generates more predictable outcomes than not playing “Pong” by reducing uncertainty. Note that a “miss” results in unpredictable outcomes because the ball resets and its subsequent motion is unpredictable. In terms of the informational entropy of the stimulus being delivered, while an unpredictable stimulus would have high entropy, the silent condition still entails higher entropy relative to successful play as the ball restarts in a random direction. This is consistent with our results, as the more unpredictable an outcome, the greater the observed learning effect—as the BNN learns to avoid uncertainty.

It is interesting to note, however, that the internal information entropy of BNN activity does not exactly mirror the information entropy of the external stimulation: while the unpredictable stimulus increased internal entropy, so did the Silent condition feedback. However, for a BNN to alter activity in response to feedback, there must be a change to its sensory input observable by the system that can be associated with its previous activity. This is consistent with the absence of learning in the open-loop/No-feedback condition, which by its nature affords no opportunity for learning, and likewise showed higher internal information entropy than the other two feedback conditions. This supports the thesis that stimulation alone is insufficient to drive learning: there must be a motivation for learning behaviors that influence the (external) observable stimulus. When faced with unpredictable sensorium, playing “Pong” successfully acts as a free energy-minimizing solution. Even if the internal information entropy of a system is increased following feedback and has lower external information entropy (e.g., silent feedback), this may not provide the same impetus for learning. These findings accord with the proposed role of a Markov blanket, providing a statistical boundary of the system to separate it into internal and external states (

Kirchhoff M.
Parr T.
Palacios E.
Friston K.
Kiverstein J.

The Markov blankets of life: autonomy, active inference and the free energy principle.

J. R. Soc. Interface. 2018; 15: 20170792https://doi.org/10.1098/rsif.2017.0792

Crossref
PubMed
Scopus (154)
Google Scholar

;

Palacios E.R.
Razi A.
Parr T.
Kirchhoff M.
Friston K.

On Markov blankets and hierarchical self-organization.

J. Theor. Biol. 2020; 486: 110089https://doi.org/10.1016/j.jtbi.2019.110089

Crossref
PubMed
Scopus (38)
Google Scholar

). Yet simply minimizing entropy (i.e., average surprise) may offer an overly simplified account of adaptive behavior: a key aspect of active inference is the selection of actions that minimize the surprise or free energy expected on following that action. While these results are interesting and supportive, they are not conclusive, and future work is required, including exploring BNN behavior with a generative model.

Mechanistically, we sought to demonstrate the utility of the DishBrain by testing base principles that underwrite active sensing via the FEP. The closest previous work examined blind source separation in neural cultures, yet did so in an open-loop context without physiologically plausible training (

Isomura T.
Kotani K.
Jimbo Y.

Cultured Cortical Neurons Can Perform Blind Source Separation According to the Free-Energy Principle.

PLoS Comput. Biol. 2015; 11: e1004643https://doi.org/10.1371/journal.pcbi.1004643

Crossref
PubMed
Scopus (22)
Google Scholar

;

Isomura T.
Friston K.

In vitro neural networks minimise variational free energy.

Sci. Rep. 2018; 8: 16926https://doi.org/10.1038/s41598-018-35221-w

Crossref
PubMed
Scopus (18)
Google Scholar

). We show that supplying unpredictable sensory input following an “undesirable” outcome and providing predictable input following a “desirable” one significantly shapes the behavior of neural cultures in real time. The predictable stimulation could also be read as a process of stabilizing synaptic weights in line with previous research as it has been shown that higher firing rates augment short- and long-term potentiation (

Pariz A.
Esfahani Z.G.
Parsi S.S.
Valizadeh A.
Canals S.
Mirasso C.R.

High frequency neurons determine effective connectivity in neuronal networks.

Neuroimage. 2018; 166: 349-359https://doi.org/10.1016/j.neuroimage.2017.11.014

Crossref
PubMed
Scopus (15)
Google Scholar

;

Zhu G.
Liu Y.
Wang Y.
Bi X.
Baudry M.

Different Patterns of Electrical Activity Lead to Long-term Potentiation by Activating Different Intracellular Pathways.

What refers to tiny spaces between neurons?

The space between the dendrites of one neuron and the axon of another neuron is called the synapse.

Where is the synaptic gap?

The synaptic cleft, also known as the synaptic gap, is the space in between the axon of one neuron and the dendrites of another and is where the electrical signal is translated to a chemical signal that can be perceived by the next neuron.

What is the small gap between neurons quizlet?

Synapse: The synapse is a small gap between neurons that acts as a place where communication occurs between neurons. Receptors: The site where neurotransmitters attach to the proteins on the cell surface attach.

What is the gap between neurons called quizlet?

The tiny gap at this junction is called the synaptic gap or cleft. Chemical messengers that, when released by the sending neuron, travel across the synapse. (the gap between two neurons) and bind to receptor sites on receiving neurons, setting up the next link in the chain of communication within the nervous system.