Scientific
methodology,
automated.
Formulate falsifiable hypotheses. Design controlled experiments. Test against 28 real databases. Grade the evidence. 91 computational tools across 4 scientific domains. Grounded in Popper, Fisher, and GRADE.
Open source (AGPL-3.0) · Self-hostable · COSS
How It Works
Six phases. Each grounded in established science.
Every investigation follows a structured scientific protocol. Ehrlich doesn't just search the internet — it formulates hypotheses, designs experiments, tests them against real data, validates with controls, and grades the evidence using peer-reviewed frameworks.
Classification & PICO
Sackett (1996)Decompose your question into Population, Intervention, Comparison, Outcome. Auto-detect domains. Multi-domain questions merge configs automatically.
Literature Survey
GRADE + AMSTAR-2Systematic search with citation chasing. GRADE-adapted evidence grading. AMSTAR-2 quality self-assessment. Haiku compresses and classifies.
Hypothesis Formulation
Popper + Platt + BayesFalsifiable hypotheses with predictions, null predictions, success/failure criteria, scope, type, and prior confidence. You approve before testing starts.
Experiment Execution
Fisher (1935)Experiments with independent/dependent variables, controls, confounders, and analysis plans. Two experiments run in parallel. 91 tools across 4 domains.
Validation & Controls
Zhang (1999) + Y-scramblingNegative controls with known-inactive compounds. Z'-factor assay quality. Permutation significance testing. Scaffold-split vs random-split comparison.
Synthesis
GRADE synthesisCertainty grading (5 downgrading + 3 upgrading domains). Priority tiers. Limitations taxonomy. Knowledge gap analysis. Follow-up recommendations.
Every Hypothesis Carries
Every Experiment Carries
Console
What you see while it runs.
SSE events stream into the console in real time. Hypotheses update live. Candidates rank as experiments complete. Charts render when visualization tools fire. You approve hypotheses before testing begins.
Find antimicrobial compounds effective against MRSA with low resistance risk and favorable ADMET profiles
Compound X MIC < 4 µg/mL against MRSA
Resistance risk via efflux pump mutations is low
ADMET profile permits oral bioavailability
| ID | Score | Docking | ADMET | Lipinski |
|---|---|---|---|---|
| CMP-1247 | 0.94 | -8.7 kcal | Pass | 5/5 |
| CMP-0893 | 0.87 | -7.9 kcal | Pass | 5/5 |
| CMP-2156 | 0.81 | -7.2 kcal | Warn | 5/5 |
| CMP-0412 | 0.74 | -6.8 kcal | Pass | 4/5 |
Choose your team, match the task.
Every investigation assembles a team of three specialized models. Pick the tier that fits your question -- from fast exploration to maximum reasoning power.
Scientific Domains
Four domains. Domain-agnostic engine.
Each domain brings its own tools, scoring definitions, and prompt examples. The orchestrator, methodology, and persistence work identically across all. Multi-domain questions are auto-detected and merged.
MOLECULAR SCIENCE
22 TOOLSDrug discovery, antimicrobial resistance, environmental toxicology, agricultural biocontrol.
Find drug candidates effective against antibiotic-resistant KlebsiellaTRAINING SCIENCE
11 TOOLSExercise physiology, protocol optimization, injury risk assessment, clinical trial evidence.
Compare periodized vs non-periodized resistance training in trained athletesNUTRITION SCIENCE
10 TOOLSSupplement evidence, nutrient adequacy, drug interactions, inflammatory scoring, safety monitoring.
Assess safety and efficacy of vitamin D3 + K2 supplementation at high dosesIMPACT EVALUATION
9 TOOLSCausal analysis of social programs: education, health, employment, housing, sports. Four causal methods (DiD, PSM, RDD, Synthetic Control), 13 data sources across US and Mexico.
What is the causal effect of conditional cash transfers on school enrollment in Latin America?Multi-Domain Investigations
Ask a question that spans multiple domains and Ehrlich detects it automatically. DomainRegistry.detect() returns all matching domains. merge_domain_configs() creates a synthetic config with the union of tool tags, concatenated scoring definitions, and joined prompt examples. The researcher sees tools from all relevant domains.
Add Your Domain
Register a DomainConfig with tool tags, data sources, scoring definitions, and prompt examples. The engine handles orchestration, persistence, visualization, and reporting. Connect external tools via MCP servers — community-built domains plug in without modifying the core engine.
Visualizations
The system picks the right visualization.
The orchestrator intercepts tool results and renders the matching visualization automatically. 3D molecular viewers for docking results. Statistical plots for meta-analysis. Anatomy diagrams for training. Node graphs for hypothesis tracking. No configuration needed.
3D Molecular Viewers
3Dmol.js WebGL- •Live Lab Viewer — SSE-driven scene: protein targets load, ligands dock, candidates color by score
- •3D Conformer Viewer — MMFF94-optimized 3D structures with interactive rotate/zoom
- •Docking Viewer — Protein + ligand overlay showing binding pocket and interactions
Statistical Charts
Recharts + Visx- •Forest Plot — Meta-analysis effect sizes with confidence intervals
- •Funnel Plot — Publication bias assessment across studies
- •Dose-Response Curve — Dose-response with confidence band (Visx)
- •Evidence Matrix — Hypothesis-by-evidence heatmap (Visx)
Domain-Specific Charts
Recharts + Custom SVG- •Binding Scatter — Compound binding affinities across targets
- •ADMET Radar — Drug-likeness property profiles (6 axes)
- •Training Timeline — Training load with ACWR danger zones + brush
- •Performance Chart — Banister fitness-fatigue model (CTL/ATL/TSB)
- •Muscle Heatmap — Anatomical front/back body diagram with activation intensity
- •Nutrient Comparison — Grouped bar chart comparing foods
- •Nutrient Adequacy — Horizontal bars showing % RDA with MAR score
- •Therapeutic Window — EAR/RDA/UL safety zones per nutrient
- •Program Dashboard — Multi-indicator KPI view with target tracking
- •Geographic Comparison — Region bar chart with benchmark line
- •Parallel Trends — DiD treatment vs control over time
Investigation UI
React Flow + Custom- •Investigation Diagram — Hypothesis/experiment/finding node graph with status colors and revision edges
- •Hypothesis Board — Kanban grid with expandable confidence bars and approval cards
- •Candidate Table — Thumbnail grid with 2D SVG + expandable 3D viewer + Lipinski badge
- •Candidate Comparison — Side-by-side scoring view for 2-4 candidates with best-in-group highlighting
- •Investigation Report — 8-section structured report with full audit trail and markdown export
Add Your Own
When you register a new domain, you can create custom visualization components using any rendering library: Recharts, Visx, D3, custom SVG, WebGL, maps, network graphs. Register them in the VizRegistry by viz_type string. The orchestrator auto-intercepts any tool result containing that type and renders it inline. Suspense boundaries, grid layout, and error fallbacks are handled for you.
Every claim
has a source.
Ehrlich queries trusted global databases in real time. Findings link to ChEMBL compound IDs, PDB structure codes, DOIs, and PubChem CIDs. No hallucinated citations. No invented data points.
Self-Referential Research
Every investigation's findings are indexed in a full-text search database. Future investigations query past findings via search_prior_research. Knowledge compounds over time.
Who It's For
Same product at every level.
All 91 tools, all 28 data sources, and the full 6-phase methodology at every tier. The only variable is the Director model quality.
Student
Free Haiku. 3 investigations/month.Learn scientific methodology by doing it. Every investigation teaches hypothesis design, experimental controls, and evidence evaluation. Same tools the professionals use.
Academic Researcher
Monthly credits. Sonnet for routine, Opus for publications.Run systematic reviews, test hypotheses across domains, build on prior findings through self-referential search. Full audit trail for reproducibility.
Industry / Government
BYOK. Your Anthropic key, our methodology + tools.91 computational tools, 28 data sources, structured reporting. Commercial license for private modifications. Self-host or use the hosted instance with your own Anthropic key.
Why Ehrlich
What makes this different
The AI implements the scientific methodology. It doesn't invent it. Tools execute on real data. Findings link to real sources.
Real Computation
91 tools that compute, not summarize.Ehrlich trains ML models, runs causal inference, executes statistical tests, and validates with controls. Every tool returns structured data from real computation or real APIs -- not summaries.
- •Molecular docking + drug-likeness profiling
- •ML classifiers on any structured data (train, predict, cluster)
- •Causal inference: Difference-in-Differences, Propensity Score Matching, RDD
- •Statistical testing (t-test, Mann-Whitney, Fisher, chi-squared)
- •Nutrient interaction screening + adverse event monitoring
Open Source, Self-Hostable
COSS. Same code, two paths.Self-host with your own API key for free -- no limits, no credits, no account. Or use the hosted instance where credits cover Anthropic API costs. A student in Mexico and a pharma company in Boston get the same 91 tools, the same 28 data sources, the same methodology.
- •Self-host: clone, bring your API key, no limits
- •Hosted: credits cover Anthropic costs (Opus is expensive)
- •Credits: Haiku (1), Sonnet (3), Opus (5)
- •AGPL-3.0: inspect, modify, extend, contribute
- •Commercial license for private modifications
Structured Methodology
Popper, Fisher, GRADE. Not conversation.Every investigation follows a 6-phase protocol with falsifiable hypotheses, controlled experiments, evidence hierarchies, and GRADE certainty grading. Findings link to real source IDs. You approve hypotheses before testing begins.
- •Falsifiable hypotheses with predictions + criteria
- •Controlled experiments with confounders + analysis plans
- •8-tier evidence hierarchy traced to original sources
- •GRADE certainty grading on final synthesis
- •User approval gate before experiment execution
Open Source
Ehrlich is COSS -- Commercial Open-Source Software. The same model used by Supabase, PostHog, Cal.com, and GitLab. The entire codebase is open source under AGPL-3.0. There is no proprietary version.
MATERIALS_SCIENCE = DomainConfig(
name="Materials Science",
tool_tags=frozenset({"materials", "simulation"}),
score_definitions=[
ScoreDefinition(
name="hardness",
label="Vickers Hardness",
unit="HV",
),
],
prompt_examples=[
"Discover alloys with high-temperature stability..."
],
)
registry.register(MATERIALS_SCIENCE)AGPL-3.0 (Free Use)
Students, academics, and individual researchers use Ehrlich freely. Self-host internally without restrictions. If you offer Ehrlich as a network service, modifications must be open-sourced.
Commercial License
Companies that want private modifications purchase an AGPL exemption. Includes commercial support, SLA, and custom domain development. Precedent: MongoDB, Confluent, GitLab, Spree Commerce.
91 Tools, 4 Domains
Molecular, training, nutrition, and impact evaluation. Each domain brings its own tools, scoring, and visualization. Add a DomainConfig and the engine handles the rest.
Roadmap
Three domains today. Any domain tomorrow.
The engine is domain-agnostic. Register a DomainConfig with tools, data sources, and scoring definitions. The orchestrator, methodology, and visualization pipeline work identically across all domains.
Materials Science
Alloy design, polymer properties, crystal structure prediction. ICSD, Materials Project, AFLOW databases.
Genomics
Gene expression analysis, variant interpretation, pathway enrichment. NCBI, Ensembl, UniProt cross-referencing.
Environmental Science
Pollution monitoring, climate data analysis, biodiversity assessment. EPA, NOAA, GBIF integration.
MCP Ecosystem
Connect external MCP servers as tool providers. Community-built domains plug in without code changes to the core engine.
REST API
Programmatic access to investigations. Start, monitor, and retrieve results via API. Webhook notifications on completion.
Multi-Provider
Swap the Director, Researcher, or Summarizer to any LLM provider. OpenAI, Google, open-weight models. Mix providers per role for cost or capability.
Team Collaboration
Shared investigations, commenting, branching hypotheses. Build on each other's findings across your research group.
Public Beta
Hosted instance pricing.
Self-hosting is free with your own API key. The hosted instance uses Pay-as-you-go Credits (Haiku=1, Sonnet=3, Opus=5) to cover Anthropic costs. Alternatively, use Bring Your Own Key (BYOK) for free, unlimited hosted access (subject only to your Anthropic API limits).
Credits
Hosted infrastructure with no setup. Credits cover Anthropic API costs.
- •Haiku investigation = 1 credit
- •Sonnet investigation = 3 credits
- •Opus investigation = 5 credits
- •Full 6-phase methodology
- •Hosted high-performance infrastructure
BYOK
Bring Your Own Key. Use your Anthropic API key directly. Ideal for judges and heavy testing.
- •Your own Anthropic API key
- •No Ehrlich credit limits
- •We cover the compute/hosting cost
- •Full 91 tool access
- •Perfect for hackathon evaluation
Or self-host.
Clone the repo, add your API key, run the server. No account needed. Full AGPL-3.0 access to everything.
$ git clone https://github.com/Sequela02/ehrlich$ cd ehrlich/server && uv sync$ export ANTHROPIC_API_KEY=sk-...$ uv run uvicorn ehrlich.api.app:create_app --factory --port 8000
Run your first investigation.
Free tier. No credit card. 3 Haiku investigations per month. Full methodology. All tools.