Dask arrays

This module provides alternative implementations of array classes defined in the allel.model.ndarray module, using dask.array as the computational engine.

Dask uses blocked algorithms and task scheduling to break up work into smaller pieces, allowing computation over large datasets. It also uses lazy evaluation, meaning that multiple operations can be chained together into a task graph, reducing total memory requirements for intermediate results, and only the tasks required to generate the requested part of the final data set will be executed.

This module is experimental, if you find a bug please raise an issue on GitHub.

This module requires dask >= 0.11.1.

GenotypeDaskArray

class allel.GenotypeDaskArray(data, chunks=None, name=None, lock=False)[source]

HaplotypeDaskArray

class allel.HaplotypeDaskArray(data, chunks=None, name=None, lock=False)[source]

AlleleCountsDaskArray

class allel.AlleleCountsDaskArray(data, chunks=None, name=None, lock=False)[source]