Skip to the content.

!!! OLD VERSION - MOVED TO GITLAB !!!

Integrating public and private barcode data

This project aims to provide tooling for managing ARISE DNA barcode metadata to achieve the following:

✅ Ingest Dutch species registry (NSR) taxonomy as DarwinCore data.

✅ Ingest NSR synonyms from tab-separated tables

✅ Harvest remote data and metadata accessible through BOLD APIs

✅ Integrate data (FASTA/BLAST DB) and metadata (SQLite) to enrich sequence records

⌛ Report and visualize gap analysis of database contents for targeted sequencing (JuPyter)

⌛ Ingest on-disk, file-based data (FASTA) and metadata (CSV/TSV) managed using Geneious and Klasse.

⌛ Filter candidate barcodes by flexible criteria, e.g. marker, provider, geographic origin

⏳ Navigate database contents in tabular form as web view and REST API (Django)

How to use

Consult the doc folder for more info.

Authors

Bastiaan Anker, Pierre-Etienne Cholley, Naomi van Es, Rutger Vos.

(Feel free to add yourself here in any substantial pull requests.)

License

This source code is made available under the MIT License.