<?xml version='1.0' encoding='UTF-8'?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
  <id>https://blog.lamin.ai</id>
  <title>Blog</title>
  <updated>2026-03-26T21:10:50.619594+00:00</updated>
  <link href="https://blog.lamin.ai"/>
  <link href="https://blog.lamin.ai/index/atom.xml" rel="self"/>
  <generator uri="https://ablog.readthedocs.org/" version="0.10.25">ABlog</generator>
  <entry>
    <id>https://blog.lamin.ai/hello</id>
    <title>Hello world!</title>
    <updated>2022-05-04T00:00:00+00:00</updated>
    <author>
      <name>Alex Wolf</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;We just launched &lt;a class="reference external" href="https://lamin.ai"&gt;lamin.ai&lt;/a&gt; as a place for sharing prototypes with our beta customers and collaborators.
Over time, we’ll add public releases and use this blog to explain our work.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/hello"/>
    <summary>We just launched lamin.ai as a place for sharing prototypes with our beta customers and collaborators.
Over time, we’ll add public releases and use this blog to explain our work.</summary>
    <published>2022-05-04T00:00:00+00:00</published>
  </entry>
  <entry>
    <id>https://blog.lamin.ai/problems</id>
    <title>Key problems of data-heavy R&amp;D</title>
    <updated>2022-07-31T00:00:00+00:00</updated>
    <author>
      <name>Alex Wolf</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;The complexity of modern R&amp;amp;D data often blocks realizing the scientific progress it promises.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/problems"/>
    <summary>The complexity of modern R&amp;D data often blocks realizing the scientific progress it promises.</summary>
    <published>2022-07-31T00:00:00+00:00</published>
  </entry>
  <entry>
    <id>https://blog.lamin.ai/readfcs</id>
    <title>readfcs: Read FCS files</title>
    <updated>2022-08-27T00:00:00+00:00</updated>
    <author>
      <name>Alex Wolf</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;&lt;a class="reference external" href="https://lamin.ai/docs/readfcs"&gt;readfcs&lt;/a&gt; is a lightweight open-source Python package that loads data and metadata from Flow Cytometry Standard (FCS) files into &lt;code class="docutils literal notranslate"&gt;&lt;span class="pre"&gt;DataFrame&lt;/span&gt;&lt;/code&gt; and &lt;code class="docutils literal notranslate"&gt;&lt;span class="pre"&gt;AnnData&lt;/span&gt;&lt;/code&gt; objects, allowing users to flexibly use downstream analytical tools.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/readfcs"/>
    <summary>readfcs is a lightweight open-source Python package that loads data and metadata from Flow Cytometry Standard (FCS) files into DataFrame and AnnData objects, allowing users to flexibly use downstream analytical tools.</summary>
    <published>2022-08-27T00:00:00+00:00</published>
  </entry>
  <entry>
    <id>https://blog.lamin.ai/nbproject</id>
    <title>nbproject: Manage Jupyter notebooks</title>
    <updated>2022-08-29T00:00:00+00:00</updated>
    <author>
      <name>Alex Wolf</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;&lt;a class="reference external" href="https://lamin.ai/docs/nbproject"&gt;nbproject&lt;/a&gt; is an open-source Python tool to help manage Jupyter notebooks with metadata, dependency, and integrity tracking.
A draft-to-publish workflow creates more reproducible notebooks with context.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/nbproject"/>
    <summary>nbproject is an open-source Python tool to help manage Jupyter notebooks with metadata, dependency, and integrity tracking.
A draft-to-publish workflow creates more reproducible notebooks with context.</summary>
    <published>2022-08-29T00:00:00+00:00</published>
  </entry>
  <entry>
    <id>https://blog.lamin.ai/mapped-collection</id>
    <title>MappedCollection: Weighted random sampling from large collections of scRNA-seq datasets</title>
    <updated>2024-04-03T00:00:00+00:00</updated>
    <author>
      <name>Alex Wolf</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;A few labs and companies now train models on large-scale scRNA-seq count matrices and related data modalities. But unlike for many other data types, there isn’t yet a playbook for data scales that don’t fit into memory.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/mapped-collection"/>
    <summary>A few labs and companies now train models on large-scale scRNA-seq count matrices and related data modalities. But unlike for many other data types, there isn’t yet a playbook for data scales that don’t fit into memory.</summary>
    <published>2024-04-03T00:00:00+00:00</published>
  </entry>
  <entry>
    <id>https://blog.lamin.ai/cellxgene</id>
    <title>A programmatically queryable CELLxGENE LaminDB instance</title>
    <updated>2026-02-21T00:00:00+00:00</updated>
    <author>
      <name>Alex Wolf</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;&lt;a class="reference external" href="https://cellxgene.cziscience.com/"&gt;CZ CELLxGENE&lt;/a&gt; hosts one of the largest standardized collections of single-cell RNA-seq datasets.
Its &lt;a class="reference external" href="https://chanzuckerberg.github.io/cellxgene-census/"&gt;Census&lt;/a&gt; provides efficient access via TileDB-SOMA, and individual datasets are available as &lt;code class="docutils literal notranslate"&gt;&lt;span class="pre"&gt;.h5ad&lt;/span&gt;&lt;/code&gt; files on S3.
However, programmatically querying &lt;em&gt;across&lt;/em&gt; datasets by arbitrary metadata combinations — cell types, tissues, diseases, assays, collections, donor information — has required writing custom data wrangling code.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/cellxgene"/>
    <summary>CZ CELLxGENE hosts one of the largest standardized collections of single-cell RNA-seq datasets.
Its Census provides efficient access via TileDB-SOMA, and individual datasets are available as .h5ad files on S3.
However, programmatically querying across datasets by arbitrary metadata combinations — cell types, tissues, diseases, assays, collections, donor information — has required writing custom data wrangling code.</summary>
    <published>2026-02-21T00:00:00+00:00</published>
  </entry>
  <entry>
    <id>https://blog.lamin.ai/symbolic-memory</id>
    <title>Symbolic memory for biological R&amp;D</title>
    <updated>2026-02-27T00:00:00+00:00</updated>
    <author>
      <name>Alex Wolf</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;What should the shared memory layer for agents and humans look like?
Will it live in embeddings or in records?
A high-level note.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/symbolic-memory"/>
    <summary>What should the shared memory layer for agents and humans look like?
Will it live in embeddings or in records?
A high-level note.</summary>
    <published>2026-02-27T00:00:00+00:00</published>
  </entry>
  <entry>
    <id>https://blog.lamin.ai/vitessce</id>
    <title>Interactive visualization of multimodal and spatial data with Vitessce</title>
    <updated>2026-03-02T00:00:00+00:00</updated>
    <author>
      <name>Sunny Sun</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;The open-source tool &lt;a class="reference external" href="https://vitessce.io"&gt;Vitessce&lt;/a&gt; and Lamin now work together to manage &amp;amp; visualize multimodal and spatial single-cell data.
It’s simple: define a Vitessce config in code, save it as an artifact, and share the interactive visualization along with your datasets on LaminHub.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/vitessce"/>
    <summary>The open-source tool Vitessce and Lamin now work together to manage &amp; visualize multimodal and spatial single-cell data.
It’s simple: define a Vitessce config in code, save it as an artifact, and share the interactive visualization along with your datasets on LaminHub.</summary>
    <published>2026-03-02T00:00:00+00:00</published>
  </entry>
  <entry>
    <id>https://blog.lamin.ai/sparse-measurements</id>
    <title>A data lakehouse for biology's sparse measurements</title>
    <updated>2026-03-04T00:00:00+00:00</updated>
    <author>
      <name>Jesse Johnson</name>
    </author>
    <content type="html">&lt;div class="ablog-post-excerpt docutils container"&gt;
&lt;p&gt;One avenue into the future of biotech is scaled learning from multi-modal datasets.
Given that the union of these datasets can easily span millions of sparse features, they can’t be queried through any established data infrastructure.
Warehouses are too rigid, data lakes can’t be queried, and tabular lakehouses don’t understand the formats.
Biology needs a data lakehouse with support for bio-formats and registries.&lt;/p&gt;
&lt;/div&gt;
</content>
    <link href="https://blog.lamin.ai/sparse-measurements"/>
    <summary>One avenue into the future of biotech is scaled learning from multi-modal datasets.
Given that the union of these datasets can easily span millions of sparse features, they can’t be queried through any established data infrastructure.
Warehouses are too rigid, data lakes can’t be queried, and tabular lakehouses don’t understand the formats.
Biology needs a data lakehouse with support for bio-formats and registries.</summary>
    <published>2026-03-04T00:00:00+00:00</published>
  </entry>
</feed>
