Skip to main content

4 posts tagged with "neurosynth"

View All Tags

· 6 min read
Alejandro de la Vega

We’re excited to share the release of a significant new innovation in Neurosynth Compose: AI-Assisted Curation. This new feature aims to simplify and accelerate custom neuroimaging meta-analyses, making it easier than ever to curate a set of studies for inclusion into a quantitative meta-analysis.

Traditionally, reviewing studies for inclusion in a meta-analysis takes hundreds of hours, even with a user-friendly interface. The original Neurosynth platform helped by using text mining to group studies by keywords, but extracting crucial details—like participant demographics or task design—remained a challenge because such information is inconsistently reported.

With Large Language Models (LLMs) and zero-shot learning, we can now automatically extract structured study information directly from full-text articles. This a key step towards precise, automated neuroimaging meta-analysis.

LLMs for Neuroscientific Information Extraction

Traditionally, building AI models for information extraction required large annotated datasets, which limited their use in fields like neuroimaging with little labeled data. Modern LLMs change this by interpreting scientific text without domain-specific training. Using zero-shot learning, they can now extract information from articles even without task-specific examples.

By prompting LLMs carefully, we can extract verifiable details—like sample size, diagnosis, age range, tasks, and modalities—from more than 30,000 studies in the NeuroStore database. These details are displayed during curation, making it easier to compare studies and decide which to include.

Figure 1 Figure 1. High-level overview of Zero Shot Information Extraction using LLM prompting

A New Curation Experience

This information is presented in two key places within the revamped curation user interface: a concise table view, allowing for easy comparison across papers, and more detailed individual study pages where you can delve deeper into the extracted specifics for a single paper.

This supports a faster, PRISMA-compliant screening and eligibility process, all within a web-based interface.

Figure 2 Figure 2. Table view showing AI-extracted information (Task Name, Group names, Diagnosis), across three studies

By clicking on a row in the table view, you can see detailed study-level extracted information:

Figure 3 Figure 3. Detailed study-evel AI-extracted information, showing Participant Demographics.

Validated, Iterative Development

Unlike general-purpose review tools (e.g., Elicit, Perplexity, Google Notebook LM), our approach is grounded in domain expert neuroimaging guidelines. First, we developed expert-driven extraction schemas for neuroimaging. Next, each schema is validated by comparing AI outputs with manual annotations (treated as the gold standard), ensuring accuracy before release.

Figure 4

Figure 4. Iterative annotation and prompt development workflow.

This ongoing effort is open to community feedback. The goal is to continuously refine and check our extraction guidelines for neuroimaging-specific study information that help researchers find and screen studies for inclusion into meta-analysis.

Annotated studies are sourced from PubMed Central using pubget and annotated using labelbuddy— a set of tools recently introduced by our group for literature mining (Dockes et al., 2024).

Annotations are openly accessible in the labelbuddy annotations GitHub repository, and extraction pipelines are also made available for transparency.

Initial Extraction Schemas

At launch, we support two schemas: participant demographics and experimental details., extracted from the full text of articles using GPT-4—

1. Participant Demographics

Extracts demopgrahic details separately for each participant group:

FieldDescription
countTotal participants in the group (exclude dropouts).
diagnosisExact clinical/medical diagnosis, including subtypes and comorbidities.
group_nameGroup type: "healthy" (controls) or "patients" (clinical).
subgroup_nameVerbatim group name, if provided.
male_countNumber of males, if explicitly reported.
female_countNumber of females, if explicitly reported.
age_meanMean age, if stated directly in the text.
age_rangeAge range as stated (e.g., "18-65"); use dash format.
age_minimumLowest age reported or lower bound of range.
age_maximumHighest age reported or upper bound of range.
age_medianMedian age, only if explicitly provided.

Validation on 220 articles showed high accuracy, especially for participant counts (<15% error) and diagnosis classification (>0.8 F1-score using BERTScore, on subset of 100 studies with annotated diagnosis). Qualitative analysis confirmed that LLMs are increasingly adept at capturing specific diagnostic information (e.g., "Autism Spectrum Disorder", "phobic prone", "eating disorders prone") and associating them correctly with relevant demographic data, even if the specific form differed from the manual annotation.

2. Experimental Details

Captures study design and task information:

FieldDescription
ModalityImaging modalities used (e.g., "fMRI-BOLD", "MEG", "PET").
StudyObjectiveBrief summary of the study’s main research question or goal.

For each fMRI task presented within the study, the following was also extracted:

FieldDescription
TaskNameExact task name as stated in the text; if not named, provide brief description.
TaskDescription1–2 sentence summary of instructions, stimuli, measures, and objectives.
DesignDetailsDetailed design: type, timing, structure, presentation, response methods.
ConditionsAll experimental and control conditions mentioned.
TaskMetricsAll measured outcomes: behavioral, neural, and subjective.
ConceptsSpecific mental/cognitive concepts explicitly mentioned.
DomainPrimary cognitive domains engaged, if stated.
RestingStateTrue only if described explicitly as a resting-state scan.
RestingStateMetadataRest-specific details: duration, instructions, eyes open/closed, etc.
TaskDesignTask design type(s): Blocked, EventRelated, Mixed, or Other.
TaskDurationTotal task duration (e.g., "10 minutes" or "600 seconds").

Validation on 104 studies found high accuracy for modality and resting-state fields (94%) and strong performance for task information (1-Levenshtein distance of 0.9), particularly when task names were clearly reported in the original sudies (64% of studies). For studies without a clearly defined task name, qualitative review indicated GPT often provided a coherent and plausible description of the task based on the provided context.

This preliminary validation is just a first step. Stay tuned for a more comprehensive evaluation of AI-extracted neuroimaging features!

Get Started

You can try AI-Assisted Curation now at compose.neurosynth.org.

This is an ongoing project: we’ll keep expanding schemas and refining accuracy. We welcome your feedback and ideas—join the conversation on NeuroStars, our discussion forum.

· 2 min read
Alejandro de la Vega

Hello Neurosynth Users,

2023 was a very exciting year for Neurosynth, having launched our Compose platform to the public and announced it on social media. In the December we’ve saw over 500 new user visits, with 200 users signing up for an account! 🚀

Help us keep this growth going by sharing our announcement with your colleagues. 🧑‍🔬

🌟 What’s New 🌟

We’ve also continued to introduce new features and improve the user experience. Here’s some highlights:

Large-scale association tests

A key feature that set Neurosynth aside were large-scale association maps (previously known as “reverse inference”).

Whereas a typical meta-analysis tells you if activity is consistently reported in a target set of studies, an association test tells you if activation occurs more consistently in this set of studies versus a large and diverse reference sample.

That's important, because this allows you to control for base rate differences between regions. Certain regions, such as the insula or lateral PFC for instance, play a very broad role in cognition, and hence are consistently activated for many different tasks and cognitive states. Using MKDA Chi-Squared, you can test if brain activity in a region (such as the insula) is specifically associated with the studies in your meta-analysis.

Previously association tests were available for the automatically generated maps on neurosynth.org. Now you can perform large-scale association tests for your custom meta-analyses in Neurosynth Compose.

We have created a full primer and tutorial on MKDA Chi-Squared, including an example from a recent meta-analysis on social processing. Check it out!

MKDA Chi-Squared Tutorial 🧑‍🎓

UX Enhancements ✨

Based on your valuable feedback, we've made numerous bug fixes and improvements:

  • Simplified Curation: The review import page has been removed, and summary information is now added directly to the tag step.

  • Searching UI: We've replaced the dropdown with a selection gallery, making it easier to choose your preferred search method, and we now auto-generate search import names. In addition, resolving duplicates is skipped if none are present.

  • Improved Editing Workflow: The editing interface has been improved, streamlining the extraction process.

  • Various UX Improvements and Fixes: We fixed many papercuts, especially in the Extraction phase.

We hope you enjoy these changes.

Email us any feedback, or ask a question on NeuroStars if you have issues.

Cheers,

The Neurosynth Team 🧠

· 2 min read
Alejandro de la Vega

Dear Neurosynth Community,

I'm excited to announce important updates to Neurosynth Compose: A free and open platform for neuroimaging meta-analysis.

First, we have added some easy to follow tutorials to our documentation, making it easy to become familiar with our platform.

The tutorials cover two main uses cases we support: Manual and Automated Meta-analyses. Our platform make gold-standard manual meta-analyses much easier, by leveraging pre-extracted imaging data and streamline user interfaces. Automated meta-analyses are ideal for generating exploratory results rapidly, enabling meta-analysis as part of routine scientific practice.

We've also made many small but important updates to our platform, including significant performance updates and improvements to the user interface. Neurosynth Compose is now more intuitive and easier to use. Give it a try by following our manual meta-analysis tutorial.

We also have some exciting new features in the pipeline that we'll release in early 2024 including:

  • Image-based Meta-Analysis (IBMA). Soon, you will be able to use NeuroVault data as inputs for IBMA-- a more powerful and sensitive alternative to Coordinate Based Meta-Analysis.
  • Advanced data extraction using Large Language Models (GPT). Early protypes to extract detailed information (such as participant demographics) from neuroimaging articles using LLMs have shown promise. We are working on incorporating these workflows into Neurosynth Compose, making it even easier to identify relevant studies for meta-analysis.

We look forward to your feedback!

-Alejandro

· 3 min read
Alejandro de la Vega

Dear Neurosynth Community,

My name is Alejandro, and I am the current project leader of the Neurosynth project.

I am very excited to announce to you that the Neurosynth project lives on, and we are officially announcing the (beta) release of the latest member of the ecosystem: Neurosynth Compose.

Neurosynth Compose enables users to easily perform custom neuroimaging meta-analyses using a web-based platform, with no programming experience required. This project addresses one of the most commonly request features, which is the ability to customize large-scale meta-analyses using you own expert knowledge.

Neurosynth Compose is free to use and helps users:

  • 🔎 Search across over 20,000 studies in the Neurosynth database, or import from external databses such as PubMed.
  • 🗃️ Curate your StudySet using systematic review tools conforming to the PRISMA guidelines.
  • 📝 Extract coordinates and metadata for each study, leveraging thousands of pre-extracted studies to minimize effort.
  • 📊 Analyze by specifying a reproducible NiMARE workflow, and execute it locally or in the cloud.
  • 🔗 Share with the community with complete provenance and reproducibility.

The goal of Neurosynth Compose is to enable researchers to go beyond the finite set of automatically generated meta-analyses from the original platform and overcome limitations from automated coordinate and semantic extraction. The end result is a gold standard meta-analysis, in much less time than a manual workflow, and with much greater reproducible.

Currently, Neurosynth Compose is in beta, and under active development. We welcome feedback to ensure our platform meets the needs of the community. Please leave us feedback using the button on the bottom right corner of the screen!

We are working on several upcoming features that will make the platform even better. Many of these features are already available in our Python meta-analysis library, NiMARE, and we are actively working on the user facing online interfaces.

  • Image-based Meta-Analysis (IBMA). We have developed algorithms in NiMARE for using whole-brain statistical maps as inputs to meta-analysis. This is more powerful and sensitive technique compared to Coordinate-base Meta-Analysis. Soon, you will be able to use NeuroVault data as inputs for your meta-analyses.
  • MKDA Chi-squared / Association test. A hallmark feature of Neurosynth is the ability to relate meta-analytic findings to the rest of the literature, to determine the strength and specificity of an association (this was previously called "reverse inference"). This will soon be possible on your custom meta-analyses.
  • A wide range of improvements to the user experience. We are in the process of re-working parts of the online interface to decrease friction when creating a StudySet, making study utilization, and editing more intuitive.

I would like to thank everyone involved in this highly-collaborative project, but especially James Kent, a postdoctoral fellow, and Nick Lee, a software engineer, who did the lion's share of the work.

We are excited for you to try it and let us know what you think.

-Alejandro