!!! OLD VERSION - MOVED TO GITLAB !!!
Integrating public and private barcode data
This project aims to provide tooling for managing ARISE DNA barcode metadata to achieve the following:
✅ Ingest Dutch species registry (NSR) taxonomy as DarwinCore data.
✅ Ingest NSR synonyms from tab-separated tables
✅ Harvest remote data and metadata accessible through BOLD APIs
✅ Integrate data (FASTA/BLAST DB) and metadata (SQLite) to enrich sequence records
⌛ Report and visualize gap analysis of database contents for targeted sequencing (JuPyter)
⌛ Ingest on-disk, file-based data (FASTA) and metadata (CSV/TSV) managed using Geneious and Klasse.
⌛ Filter candidate barcodes by flexible criteria, e.g. marker, provider, geographic origin
⏳ Navigate database contents in tabular form as web view and REST API (Django)
How to use
Consult the doc folder for more info.
Authors
Bastiaan Anker, Pierre-Etienne Cholley, Naomi van Es, Rutger Vos.
(Feel free to add yourself here in any substantial pull requests.)
License
This source code is made available under the MIT License.