Data Collection Methodology

How the platform literature base was filtered, curated, and split into the datasets that power each Explore module.

3,828
Web of Science results
1,910
Food science articles
334
Curated twin-screw extrusion studies
334 studies
Qualitative dataset
Subset
104 studies / 1,566 rows
Quantitative subset with sample-by-sample data
Preview mode: browse the platform structure here, then sign in to open tools and datasets.
273 bib entries

Literature Trends

Track who is publishing, which topics are rising, and where extrusion research is heading.

Sign in to open literature trends
334 studies

Study Summaries (Qualitative Dataset)

Read structured summaries of peer-reviewed studies to quickly compare methods and findings.

Sign in to open study summaries
104 studies · 1,566 rows

Experimental Data (Quantitative Dataset)

Inspect sample-level metrics, cross-study coverage, and analytics-ready experimental data.

Sign in to explore experimental data
334 studies + 1,566 rows

Ingredients

Compare ingredient properties and formulation effects before committing to pilot runs.

Sign in to dive into ingredients
334 studies + LLM extraction pipeline

Literature Relationships

Explore evidence-backed links between ingredients, process conditions, and outcomes across the literature.

Sign in to open literature relationships

How literature relationships are built

A six-stage extraction pipeline turns full-text literature into grounded, citation-traceable ingredient-outcome relationships.

1
Ingest - study PDFs, slides, and symposium transcripts are loaded, including scanned documents.
2
Segment - text is broken into overlapping windows so claims spanning paragraph boundaries are still captured.
3
Index - each segment is encoded so relevant passages can be retrieved for ingredient and outcome queries.
4
Extract - the model pulls out ingredient, outcome, direction, confidence, and grounded mechanism summaries.
5
Standardise - varied terminology is mapped to consistent labels so findings can be compared across papers.
6
Aggregate - matching ingredient-outcome claims are merged across sources, with agreement and contradictions surfaced.
Directional and evidence-weighted - claims reflect how many independent studies agree, with contradictions made explicit.
Grounded mechanisms - explanations come from the study text itself, not free-form model speculation.
Full citation traceability - each claim can link back to a source, page, DOI, and year.
Example: "Pea protein -> Expansion: increase in 5 studies, 0 contradictions. Mechanism: higher protein content promotes melt viscosity, driving radial puffing."