PyNCBItk Stars#

Cython bindings and Python interface to the NCBI C++ Toolkit.

Actions Coverage PyPI Bioconda AUR Wheel Versions Implementations License Source Mirror Issues Docs Changelog Downloads

Overview#

The NCBI C++ Toolkit is a framework of C++ libraries to work with biological sequence data developed at the National Center for Biotechnology Information. It features a flexible object model for representing sequences of various origin, including composite or virtual sequences; a resource manager to easily manipulate heterogeneous data sources; and a comprehensive API to the various BLAST algorithms developed at the NBCI.

PyNCBItk is a Python library that provides bindings to the NCBI C++ Toolkit data model and BLAST+ interface using Cython:

Batteries-included

Just add pyncbitk as a pip or conda dependency, no need for the BLAST+ binaries or any external dependency.

Flexible

Load a BioSeq from a FASTA format file or create it programmatically through the Python API.

Practical

Retrieve results as they become available as dedicated SearchResults objects.

Setup#

PyNCBItk is available for all modern Python versions (3.7+).

Run pip install pyncbitk in a shell to download the latest release from PyPI, or have a look at the Installation page to find other ways to install PyNCBItk.

Library#

License#

This library is provided under the MIT License. The NCBI C++ Toolkit is a “United States Government Work” and therefore lies in the public domain. Some restrictions apply, see the upstream license files.

This project is in no way not affiliated, sponsored, or otherwise endorsed by the NCBI or any associated entity. It was developed by Martin Larralde during his PhD project at the Leiden University Medical Center in the Zeller team.