PyNCBItk
#
Cython bindings and Python interface to the NCBI C++ Toolkit.
Overview#
The NCBI C++ Toolkit is a framework of C++ libraries to work with biological sequence data developed at the National Center for Biotechnology Information. It features a flexible object model for representing sequences of various origin, including composite or virtual sequences; a resource manager to easily manipulate heterogeneous data sources; and a comprehensive API to the various BLAST algorithms developed at the NBCI.
PyNCBItk is a Python library that provides bindings to the NCBI C++ Toolkit data model and BLAST+ interface using Cython:
Just add pyncbitk as a pip or conda dependency, no need
for the BLAST+ binaries or any external dependency.
Load a BioSeq from a FASTA format file or create
it programmatically through the Python API.
Retrieve results as they become available as dedicated
SearchResults objects.
Setup#
PyNCBItk is available for all modern Python versions (3.7+).
Run pip install pyncbitk in a shell to download the latest release from PyPI,
or have a look at the Installation page to find other ways
to install PyNCBItk.
Library#
License#
This library is provided under the MIT License. The NCBI C++ Toolkit is a “United States Government Work” and therefore lies in the public domain. Some restrictions apply, see the upstream license files.
This project is in no way not affiliated, sponsored, or otherwise endorsed by the NCBI or any associated entity. It was developed by Martin Larralde during his PhD project at the Leiden University Medical Center in the Zeller team.