Neuroscience has witnessed a surge in data generation due to advancements in experimental techniques like electrophysiology, imaging, and genomics. To gain deeper insights into the brain's structure and function in health and disease, it has become essential to conduct large-scale data analyses.
Analyzing large datasets in neuroscience offers various applications, such as uncovering patterns in neuronal activity, building theoretical models, and predicting behavior. This has created an increasing demand for scalable, efficient, and robust data analysis and machine-learning methods that can handle the vast volume of data generated. By collaborating with domain experts, this research initiative seeks to push the frontiers of large-scale data analysis in neuroscience and foster innovative discussions to meet the field's emerging needs.
The primary aim of this Research Topic is to showcase recent progress in data-driven approaches for studying the brain. It focuses on tackling challenges in managing, processing, and interpreting large-scale neuroscience data while identifying future research opportunities. This Research Topic will delve into state-of-the-art tools and methods for analyzing, integrating, and interpreting extensive neuroscience datasets.
Excluding the retraction, there are five papers published in this Research Topic. Hsu et al. consider the problem of warping and registering brain images to a standard template, which can introduce spatial errors and reduce accuracy. They develop LYNSU (Locating by YOLO and Segmenting by U-Net), an automated method for segmenting neuropils in fluorescence images from the FlyCircuit database, eliminating the need for warping and facilitating high-throughput anatomical analysis and connectomics in the Drosophila brain. They demonstrate performance comparable to manual annotations, with a 3D Intersection-over-Union (IoU) of 0.869, and segments a neuropil in about 7 seconds.
Miranda considers task-based fMRI studies and develops a fast Bayesian function-on-scalar model to estimate population-level activation maps for the working memory task. The proposed approach uses a canonical polyadic (CP) tensor decomposition to extract shared and subject-specific features from individual coefficient maps. The subject-specific features are modeled as functions of covariates within a Bayesian framework that accounts for correlations in the CP-extracted features. The proposed decomposition facilitates fast computation and allows efficient MCMC estimation of population-level activation maps.
Dang et al. consider the problem of decoding and feature selection in high dimensions. They introduce the optimized Forward Variable Selection Decoder (oFVSD) toolbox as a feature selection methodology that combines forward variable selection (FVS) and hyperparameter optimization integrated with 18 machine learning models. They test sex classification and age range regression on 1,113 structural MRI datasets and demonstrate performance improvements over models without FVS. The methodology is available as an open-source Python package.
Bologna et al. consider the construction of data-driven brain models using neural simulation environments and large-scale computing facilities. They developed the EBRAINS Hodgkin-Huxley Neuron Builder (HHNB), a web resource for building single cell neural models via the extraction of activity features from electrophysiological data with estimation based on a genetic algorithm. HHNB then allows simulation of the brain model using the estimated model through an interactive setting.
Kim et al. consider the visualization of gene expression obtained using RNA sequencing across the brain. Molecular patterns emerging from spatial transcriptomic data can be associated with circuitry and function in the neocortex. They propose a web app LaminaRGeneVis for visualizing laminar gene expression across datasets collected using bulk, single-nucleus, and spatial RNA sequencing. Allowing for normalizations across different datasets, the app supports single- and multi-gene analyses, data visualization and statistics for the adult human neocortex.
Author contributionsFN: Writing – original draft, Writing – review & editing. OK: Writing – original draft, Writing – review & editing. FW: Writing – original draft, Writing – review & editing.
Conflict of interestThe authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Generative AI statementThe author(s) declare that Gen AI was used in the creation of this manuscript. To generate some text and suggest revisions to existing text.
Publisher's noteAll claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Keywords: neuroimaging data analysis, fMRI, computational modeling, big data, machine learning
Citation: Nathoo FS, Krigolson OE and Wang F (2024) Editorial: Emerging trends in large-scale data analysis for neuroscience research. Front. Neuroinform. 18:1538787. doi: 10.3389/fninf.2024.1538787
Received: 03 December 2024; Accepted: 10 December 2024;
Published: 20 December 2024.
Edited and reviewed by: Michael Denker, Jülich Research Centre, Germany
Copyright © 2024 Nathoo, Krigolson and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Farouk S. Nathoo, bmF0aG9vJiN4MDAwNDA7dXZpYy5jYQ==
留言 (0)