GitPedia

Sgkit

Scalable genetics toolkit

From sgkit-dev·Updated May 20, 2026·View on GitHub·

Sgkit is a Python package that provides a variety of analytical genetics methods through the use of general-purpose frameworks such as [Xarray](http://xarray.pydata.org/en/stable/), [Pandas](https://pandas.pydata.org/docs/), [Dask](https://docs.dask.org/en/latest/) and [Zarr](https://zarr.readthedocs.io/en/stable/). The project is written primarily in Python, distributed under the Apache License 2.0 license, first published in 2020. Key topics include: genetics, gwas, popgen, pydata, statgen.

Latest release: 0.10.0
April 7, 2025View Changelog →

sgkit: Scalable genetics toolkit in Python

Build
Windows
Docs
Upstream
Benchmarks
Powered by NumFOCUS

Sgkit is a Python package that provides a variety of analytical genetics methods through the use of
general-purpose frameworks such as Xarray, Pandas,
Dask and Zarr.

For more information on using sgkit, see the documentation.

The sgkit project uses a custom governance model
and is fiscally sponsored by NumFOCUS. Consider making
a tax-deductible donation to help the project
pay for developer time, professional services, travel, workshops, and a variety of other needs.

<div align="center"> <a href="https://numfocus.org/donate-to-sgkit"> <img height="60px" src="https://raw.githubusercontent.com/numfocus/templates/master/images/numfocus-logo.png" align="center"> </a> </div> <br>

Contributors

Showing top 12 contributors by commit count.

View all contributors on GitHub →

This article is auto-generated from sgkit-dev/sgkit via the GitHub API.Last fetched: 6/25/2026