2789 Publications

The neuron as a direct data-driven controller

J. Moore, A. Genkin, Magnus Tournoy, J. Pughe-Sanford, Rob R. de Ruyter van Steveninck, D. Chklovskii

Building upon the efficient coding and predictive information theories, we present a perspective that neurons not only predict but may also actively influence their future inputs through their outputs. We model neurons as feedback controllers of their environments, a role traditionally considered computationally demanding, particularly when the dynamical system characterizing the environment is unknown. By harnessing an advanced data-driven control framework, we illustrate the feasibility of biological neurons functioning as effective feedback controllers. This innovative approach enables us to coherently explain various experimental findings that previously seemed unrelated. Our research has multiple potential implications, from the modeling of neuronal circuits to enabling biologically inspired artificial intelligence systems. In the quest to model neuronal function amid gaps in physiological data, a promising strategy is to develop a normative theory that interprets neuronal physiology as optimizing a computational objective. This study extends current normative models, which primarily optimize prediction, by conceptualizing neurons as optimal feedback controllers. We posit that neurons, especially those beyond early sensory areas, steer their environment toward a specific desired state through their output. This environment comprises both synaptically interlinked neurons and external motor sensory feedback loops, enabling neurons to evaluate the effectiveness of their control via synaptic feedback. To model neurons as biologically feasible controllers which implicitly identify loop dynamics, infer latent states, and optimize control we utilize the contemporary direct data-driven control (DD-DC) framework. Our DD-DC neuron model explains various neurophysiological phenomena: the shift from potentiation to depression in spike-timing-dependent plasticity with its asymmetry, the duration and adaptive nature of feedforward and feedback neuronal filters, the imprecision in spike generation under constant stimulation, and the characteristic operational variability and noise in the brain. Our model presents a significant departure from the traditional, feedforward, instant-response McCulloch–Pitts–Rosenblatt neuron, offering a modern, biologically informed fundamental unit for constructing neural networks.

Show Abstract

AstroCLIP: a cross-modal foundation model for galaxies

Liam Parker , Francois Lanusse, Siavash Golkar, Leopoldo Sarra, Miles Cranmer, A. Bietti, Michael Eickenberg, Geraud Krawezik, Michael McCabe , R. Morel, R. Ohana, B. Régaldo-Saint Blancard, et al.

We present AstroCLIP, a single, versatile model that can embed both galaxy images and spectra into a shared, physically meaningful latent space. These embeddings can then be used – without any model fine-tuning – for a variety of downstream tasks including (1) accurate in-modality and cross-modality semantic similarity search, (2) photometric redshift estimation, (3) galaxy property estimation from both images and spectra, and (4) morphology classification. Our approach to implementing AstroCLIP consists of two parts. First, we embed galaxy images and spectra separately by pre-training separate transformer-based image and spectrum encoders in self-supervised settings. We then align the encoders using a contrastive loss. We apply our method to spectra from the Dark Energy Spectroscopic Instrument and images from its corresponding Legacy Imaging Survey. Overall, we find remarkable performance on all downstream tasks, even relative to supervised baselines. For example, for a task like photometric redshift prediction, we find similar performance to a specifically trained ResNet18, and for additional tasks like physical property estimation (stellar mass, age, metallicity, and specific-star-formation rate), we beat this supervised baseline by 19 per cent in terms of R2. We also compare our results with a state-of-the-art self-supervised single-modal model for galaxy images, and find that our approach outperforms this benchmark by roughly a factor of two on photometric redshift estimation and physical property prediction in terms of R2, while remaining roughly in-line in terms of morphology classification. Ultimately, our approach represents the first cross-modal self-supervised model for galaxies, and the first self-supervised transformer-based architectures for galaxy images and spectra.

Show Abstract

Privileged representational axes in biological and artificial neural networks

Meenakshi Khosla, A. Williams, Josh McDermott, N. Kanwisher

How do neurons code information? Recent work emphasizes properties of population codes, such as their geometry and decodable information, using measures that are blind to the native tunings (or 'axes') of neural responses. But might these representational axes matter, with some privileged systematically over others? To find out, we developed methods to test for alignment of neural tuning across brains and deep convolutional neural networks (DCNNs). Across both vision and audition, both brains and DCNNs consistently favored certain axes for representing the natural world. Moreover, the representational axes of DCNNs trained on natural inputs were aligned to those in perceptual cortices, such that axis-sensitive model-brain similarity metrics better differentiated competing models of biological sensory systems. We further show that coding schemes that privilege certain axes can reduce downstream wiring costs and improve generalization. These results motivate a new framework for understanding neural tuning in biological and artificial networks and its computational benefits.Competing Interest StatementThe authors have declared no competing interest.

Show Abstract

DIPG-15. NOVEL CNS SENSING SYNNOTCH-CAR T CELLS FOR TARGETING DIFFUSE MIDLINE GLIOMA

Senthilnath Lakshmanachetty, Milos Simic, O. Troyanskaya, et al.

Diffuse midline glioma (DMG), including Diffuse intrinsic pontine glioma (DIPG), is an aggressive brain tumor in children with limited treatment options. Recent developments of phase 1 clinical trials have shown early promise for chimeric antigen receptor (CAR) T cells in patients with DMG/DIPG. However, several barriers such as the absence of tumor-specific antigens, restricted trafficking to the tumor site, and poor persistence hinder the full therapeutic potential of CAR T cell therapy in DMG/DIPG.

Show Abstract

Learning sum of diverse features: computational hardness and efficient gradient-based training for ridge combinations

Kazusato Oko, Yujin Song, Taiji Suzuki, D. Wu

We study the computational and sample complexity of learning a target function f∗ : Rd → R with additive structure, that is, f∗(x) = 1 √ M M m=1fm(⟨x,vm⟩), where f1,f2,...,fM : R → R are nonlinear link functions of single-index models (ridge functions) with diverse and near-orthogonal index features vmM m=1, and the number of additive tasks M grows with the dimensionality M ≍dγ forγ ≥ 0. This problem setting is motivated by the classical additive model literature, the recent representation learning theory of two-layer neural network, and large-scale pretraining where the model simultaneously acquires a large number of “skills” that are often localized in distinct parts of the trained network. We prove that a large subset of polynomial f∗ can be efficiently learned by gradient descent training of a two-layer neural network, with a polynomial statistical and computational complexity that depends on the number of tasks M and the information exponent of fm, despite the unknown link function and M growing with the dimensionality. We complement this learnability guarantee with computational hardness result by establishing statistical query (SQ) lower bounds for both the correlational SQ and full SQ algorithms.

Show Abstract

Measuring and modeling the dynamics of mitotic error correction

Gloria Ha, D. Needleman, et al.

Error correction is central to many biological systems and is critical for protein function and cell health. During mitosis, error correction is required for the faithful inheritance of genetic material. When functioning properly, the mitotic spindle segregates an equal number of chromosomes to daughter cells with high fidelity. Over the course of spindle assembly, many initially erroneous attachments between kinetochores and microtubules are fixed through the process of error correction. Despite the importance of chromosome segregation errors in cancer and other diseases, there is a lack of methods to characterize the dynamics of error correction and how it can go wrong. Here, we present an experimental method and analysis framework to quantify chromosome segregation error correction in human tissue culture cells with live cell confocal imaging, timed premature anaphase, and automated counting of kinetochores after cell division. We find that errors decrease exponentially over time during spindle assembly. A coarse-grained model, in which errors are corrected in a chromosome-autonomous manner at a constant rate, can quantitatively explain both the measured error correction dynamics and the distribution of anaphase onset times. We further validated our model using perturbations that destabilized microtubules and changed the initial configuration of chromosomal attachments. Taken together, this work provides a quantitative framework for understanding the dynamics of mitotic error correction.

Show Abstract

Structure and dynamics of motor-driven microtubule bundles

Bezia Lemma, et al.

Connecting the large-scale emergent behaviors of active cytoskeletal materials to the microscopic properties of their constituents is a challenge due to a lack of data on the multiscale dynamics and structure of such systems. We approach this problem by studying the impact of depletion attraction on bundles of microtubules and kinesin-14 molecular motors. For all depletant concentrations, kinesin-14 bundles generate comparable extensile dynamics. However, this invariable mesoscopic behavior masks the transition in the microscopic motion of microtubules. Specifically, with increasing attraction, we observe a transition from bi-directional sliding with extension to pure extension with no sliding. Small-angle X-ray scattering shows that the transition in microtubule dynamics is concurrent with a structural rearrangement of microtubules from an open hexagonal to a compressed rectangular lattice. These results demonstrate that bundles of microtubules and molecular motors can display the same mesoscopic extensile behaviors despite having different internal structures and microscopic dynamics. They provide essential information for developing multiscale models of active matter.

Show Abstract

Contrastive pre-training for sequence based genomics models

Ksenia Sokolova, Kathleen M. Chen, O. Troyanskaya

In recent years deep learning has become one of the central approaches in a number of applications, including many tasks in genomics. However, as models grow in depth and complexity, they either require more data or a strategic initialization technique to improve performance. In this project, we introduce cGen, a novel unsupervised, model-agnostic contrastive pretraining method for sequence-based models. cGen can be used before training to initialize weights, reducing the size of the dataset needed. It works through learning the intrinsic features of the reference genome and makes no assumptions on the underlying structure. We show that the embeddings produced by the unsupervised model are already informative for gene expression prediction and that the sequence features provide a meaningful clustering. We demonstrate that cGen improves model performance in various sequence-based deep learning applications, such as chromatin profiling prediction and gene expression. Our findings suggest that using cGen, particularly in areas constrained by data availability, could improve the performance of deep learning genomic models without the need to modify the model architecture.

Show Abstract
June 12, 2024

Variational bounds and nonlinear stability of an active nematic suspension

We use the entropy method to analyse the nonlinear dynamics and stability of a continuum kinetic model of an active nematic suspension. From the time evolution of the relative entropy, an energy-like quantity in the kinetic model, we derive a variational bound on relative entropy fluctuations that can be expressed in terms of orientational order parameters. From this bound we show isotropic suspensions are nonlinearly stable for sufficiently low activity, and derive upper bounds on spatiotemporal averages in the unstable regime that are consistent with fully nonlinear simulations. This work highlights the self-organising role of activity in particle suspensions, and places limits on how organised such systems can be.

Show Abstract

Disparate nonlinear neural dynamics measured with different techniques in macaque and human V1

J. Zhou, Matt Whitmire, Yuzhi Chen, Eyal Seidemann

Diverse neuro-imaging techniques measure different aspects of neural responses with distinct spatial and temporal resolutions. Relating measured neural responses across different methods has been challenging. Here, we take a step towards overcoming this challenge, by comparing the nonlinearity of neural dynamics measured across methods. We used widefield voltage-sensitive dye imaging (VSDI) to measure neural population responses in macaque V1 to visual stimuli with a wide range of temporal waveforms. We found that stimulus-evoked VSDI responses are surprisingly near-additive in time. These results are qualitatively different from the strong sub-additive dynamics previously measured using fMRI and electrocorticography (ECoG) in human visual cortex with a similar set of stimuli. To test whether this discrepancy is specific to VSDI—a signal dominated by subthreshold neural activity, we repeated our measurements using widefield imaging of a genetically encoded calcium indicator (GcaMP6f)—a signal dominated by spiking activity, and found that GCaMP signals in macaque V1 are also near-additive. Therefore, the discrepancies in the extent of sub-additivity between the macaque and the human measurements are unlikely due to differences between sub- and supra-threshold neural responses. Finally, we use a simple yet flexible delayed normalization model to capture these different dynamics across measurements (with different model parameters). The model can potentially generalize to a broader set of stimuli, which aligns with previous suggestion that dynamic gain-control is a canonical computation contributing to neural processing in the brain.

Show Abstract
  • Previous Page
  • Viewing
  • Next Page
Advancing Research in Basic Science and MathematicsSubscribe to Flatiron Institute announcements and other foundation updates