Best pedigree software for research 2026: PED, Phenopackets, APIs, and reproducibility
A 2026 guide for statistical geneticists, bioinformaticians, population geneticists, and research labs — evaluating pedigree tools on the dimensions that matter in research: open formats, programmatic access, reproducibility, and publication-quality output.
Short version. For kinship and identity coefficients, QuickPed is the free reference. For PED-format programmatic pipelines, DrawPed. For GA4GH Phenopackets v2, Stanford's PedigreeTool. For REST API and MCP access plus BayesMendel risk models, Evagene. For cohort research, f-tree. For interactive visualisation, PERSEUS from Oxford. For clinical-scale research alongside a research programme, Progeny Clinical where institutional infrastructure exists. No one tool covers all cases — expect to use two or three.
Research needs are genuinely different from clinical needs. A clinical tool prioritises workflow speed and compliance; a research tool prioritises format neutrality, reproducibility, programmatic access, and citation-ready output. The same pedigree may need to move between both ecosystems.
How we evaluated
- Open formats. PED, GEDCOM 5.5.1, JSON, GA4GH Phenopackets — can data move freely?
- Programmatic access. REST API, CLI, R/Python package, MCP.
- Reproducibility. Version control, deterministic output, documented dependencies.
- Licensing. Open-source (GPL, CC BY-SA, MIT) vs proprietary with research access.
- Computation. Kinship, inbreeding, identity coefficients, BayesMendel, Mendelian probabilities.
- Visualisation. Publication-quality SVG or PDF export, large-pedigree rendering.
- Citation and peer review. Published in a peer-reviewed venue?
The tools, ranked by research fit
1. QuickPed — best free tool for kinship mathematics
QuickPed from Magnus Vigeland at Oslo is the reference free tool for computing kinship, inbreeding, and identity coefficients on pedigrees. Published in BMC Bioinformatics (2022). GPL3 licensed. Built in R Shiny; the underlying pedsuite R package family provides programmatic access for reproducible pipelines.
Where it struggles: deliberately narrow. No clinical features, no disease annotation, no Bayesian cancer risk models, no AI. Its strength is that it does one thing exactly right. For any research question that reduces to "what are the coefficients between these individuals," this is what the field uses. Price: free, open-source. Best for: statistical geneticists, forensic genetics, population genetics.
2. DrawPed — best open-source PED-format tool
DrawPed (Schönberger et al., Nucleic Acids Research 2024) is free, open-source under CC BY-SA 4.0, Perl-based, and uses the PED format. It provides programmatic access and fits into computational genetics pipelines where reproducibility and text-based inputs matter. A genuine open-source research tool with a peer-reviewed publication to cite.
Where it struggles: Perl-based tooling is not the modern default for new labs; UI is minimal; no clinical features. Best used as a component of a larger pipeline rather than a standalone research application. Price: free, open-source. Best for: computational genetics pipelines, reproducible research, labs already using PED-format infrastructure.
3. PedigreeTool (Stanford) — best for GA4GH Phenopackets
PedigreeTool from Stanford supports HPO, MONDO, and GA4GH Phenopackets v2 — the emerging standard for computable phenotype exchange in multi-site rare disease research. Free beta until March 2026; paid tiers launching April 2026. The tool's own site states it is not certified for clinical diagnostics, which is a responsible disclosure for research-tier software.
Where it struggles: pricing model from April 2026 is not yet public, and the lack of clinical certification means it cannot be used as-is for regulated clinical workflows. For research consortia standardising on Phenopackets for cross-site phenotype exchange, this is currently the clearest fit. Price: free beta through March 2026; paid tiers from April 2026. Best for: GA4GH-aligned research consortia, rare disease research networks, HPO-native cohort studies.
4. Evagene — best REST API, MCP, and risk-model integration
Evagene (our product) offers a scoped, rate-limited REST API, HMAC-SHA256-signed webhooks for eight event types, an MCP server exposing 11 pedigree tools to Claude Desktop and Claude Code, and an embeddable viewer (iframe, SVG, JS). For research pipelines that need to generate pedigrees programmatically, attach AI-driven interpretation, or have agent-based AI tools read and mutate pedigree data, this is the most complete integration surface in any pedigree platform today. BRCAPRO, MMRpro, PancPRO, and Mendelian inheritance models run directly from the pedigree. GEDCOM 5.5.1 import and export. SVG export for publication figures. BYOK LLM support (Anthropic Claude, OpenAI GPT).
Where we struggle for pure research: Evagene is not open-source. For researchers who require GPL-licensed tooling for reproducibility and long-term preservation, that is a material constraint — use DrawPed or QuickPed instead. Evagene is also Alpha, so institutional archival requirements may not be met yet. Price: free Alpha via waiting list. Best for: translational research labs, AI-agent research pipelines, studies needing BayesMendel with programmatic access.
5. f-tree — best for cohort studies with questionnaire intake
f-tree, developed with Iwate Medical University and published in BMC Medical Genetics (2017), supports Windows, macOS, iPad, and iPhone with a questionnaire-based auto-pedigree approach. NSGC-standard compliant. Free desktop tier with paid features.
Where it struggles: less internationally visible than the reference tools above. The questionnaire-first design suits cohort recruitment but may be slower for statistical geneticists who work directly from PED. Price: free desktop; paid features. Best for: cohort studies with patient questionnaire intake, Japanese research groups, iPad-based field data capture.
6. PERSEUS — best for interactive visualisation research
PERSEUS (Oxford, Bioinformatics 2024) is an interactive web-based pedigree visualisation tool. It is specifically positioned for research rather than clinical use, and its peer-reviewed publication makes it citable.
Where it struggles: newer than the other tools here, so ecosystem adoption and long-term maintenance are still being demonstrated. Price: free. Best for: research groups developing visualisation methods, teaching, interactive cohort exploration.
7. Progeny Clinical — best clinical platform with research footprint
Progeny Clinical is deployed at 800+ genetics institutions, many with research programmes alongside clinical activity. If your research sits inside an existing Progeny-using institution, extracting clinical pedigree data for research analysis through Progeny's standard export paths may be the most practical route even if the tool itself is not research-primary.
Where it struggles for research: proprietary, commercial, no public open-source licence, no programmatic research API advertised. Pricing on application. Price: on application. Best for: research embedded in institutions already using Progeny Clinical. See Progeny vs Evagene.
Comparison matrix
| Tool | Open-source | PED | Phenopackets | API / MCP | Kinship | Published |
|---|---|---|---|---|---|---|
| QuickPed | GPL3 | ✓ | — | R / pedsuite | ✓ reference | BMC Bioinfo 2022 |
| DrawPed | CC BY-SA | ✓ | — | Perl CLI | — | NAR 2024 |
| PedigreeTool | — | — | ✓ v2 | — | — | Stanford |
| Evagene | — | GEDCOM | via API | REST + MCP | Wright coefficient | — |
| f-tree | — | — | — | — | — | BMC Med Gen 2017 |
| PERSEUS | — | — | — | — | — | Bioinformatics 2024 |
| Progeny Clinical | — | — | — | — | — | commercial |
Which to choose if...
- You compute kinship, inbreeding, or identity coefficients: QuickPed.
- You run PED-format programmatic pipelines: DrawPed.
- You are in a GA4GH Phenopackets v2 research network: Stanford PedigreeTool.
- You need a REST API, MCP server, and BayesMendel integration in one place: Evagene.
- You develop novel pedigree visualisations or interactive tools: PERSEUS.
- You run cohort studies with questionnaire intake: f-tree.
- Your research sits inside a Progeny-using institution: Progeny Clinical.
- You want an AI agent (Claude Desktop, Claude Code) to read and edit pedigrees: Evagene's MCP server is the only public offering.
Frequently asked questions
What is the best pedigree software for research?
QuickPed for kinship, DrawPed for PED pipelines, PedigreeTool for Phenopackets, Evagene for REST API and MCP, f-tree for cohorts, PERSEUS for visualisation.
Which supports GA4GH Phenopackets v2?
Stanford's PedigreeTool explicitly. Evagene via its REST API.
Which has programmatic access?
Evagene (REST API, webhooks, MCP). DrawPed (Perl CLI). QuickPed (R via pedsuite).
Which free tool computes kinship coefficients?
QuickPed. GPL3, published in BMC Bioinformatics 2022.
Is any clinical tool usable for research?
Progeny Clinical at research-active institutions. Evagene for programmatic access. Phenotips in rare disease consortia.
Best for publication-quality figures?
Evagene (SVG export), QuickPed, PERSEUS.
Large cohorts?
Evagene's REST API, DrawPed's PED pipelines, QuickPed for arbitrary-size coefficient computation.