• ⸻ 2026-02-21

A programmatically queryable CELLxGENE LaminDB instance

CZ CELLxGENE hosts one of the largest standardized collections of single-cell RNA-seq datasets. Its Census provides efficient access via TileDB-SOMA, and individual datasets are available as .h5ad files on S3. However, programmatically querying across datasets by arbitrary metadata combinations — cell types, tissues, diseases, assays, collections, donor information — has required writing custom data wrangling code.