Omics-OS Docs

Lobster AI Documentation

Comprehensive documentation for Lobster AI - the AI-powered multi-omics bioinformatics analysis platform. Learn to use, develop, and extend Lobster AI.

License: AGPL-3.0-or-later Documentation: CC BY 4.0 Python 3.11-3.14

Welcome to the comprehensive documentation for Lobster AI - the AI-powered multi-omics bioinformatics analysis platform. This documentation provides everything you need to use, develop, and extend Lobster AI.

๐Ÿ“š Documentation Structure

๐Ÿš€ Getting Started

Start here if you're new to Lobster AI

๐Ÿ‘ค User Guide

Learn how to use Lobster AI for your research

๐Ÿ’ป Developer Guide

Extend and contribute to Lobster AI

๐Ÿ“– API Reference

Complete API documentation

๐Ÿ—๏ธ Architecture & Internals

Deep dive into system design

๐Ÿ”ฌ Advanced Features & Internals

Deep dives into specialized capabilities and system internals (v0.2+)

Agent Enhancements:

Content & Publication Intelligence:

Infrastructure & Performance:

Specialized Features:

Migration & Maintenance:

๐ŸŽฏ Tutorials & Examples

Learn by doing with practical tutorials

๐Ÿ”ง Support & Reference

Help and additional resources

๐ŸŽฏ Quick Navigation by Task

"I want to..."

Get Started Quickly

Analyze My Data

Understand the System

Extend Lobster AI

Solve Problems

Master Advanced Features

๐ŸŒŸ Key Features

๐Ÿค– AI-Powered Analysis

  • Natural language interface for complex bioinformatics
  • 8+ specialized AI agents for different analysis domains
  • Intelligent workflow coordination and parameter optimization

๐Ÿงฌ Scientific Capabilities

  • Single-Cell RNA-seq: QC, clustering, annotation, trajectory analysis
  • Bulk RNA-seq: pyDESeq2 differential expression with complex designs
  • Multi-Omics: Integrated cross-platform analysis

โ˜๏ธ Deployment Flexibility

  • Local Mode: Full privacy with data on your machine
  • Cloud Mode: Scalable computing with managed infrastructure
  • Hybrid: Automatic switching between modes

๐Ÿ“Š Professional Features

  • Publication-ready visualizations
  • W3C-PROV compliant provenance tracking
  • Comprehensive quality control metrics
  • Batch effect detection and correction

๐Ÿ“ˆ Version Highlights

Current Release: v0.2 is the first public release of Lobster AI. See the comprehensive documentation for features and upgrade information.

Current Features (v0.2) โœจ

Content Intelligence & Publications:

  • ๐Ÿงฌ Protein Structure Visualization - PyMOL integration for 3D protein visualization and analysis (Details)
  • ๐Ÿ”Œ ContentAccessService - Unified publication/dataset access with 5 specialized providers (Details)
  • ๐Ÿ“„ Docling PDF Parsing - Structure-aware Methods section extraction with >90% hit rate (Details)
  • ๐Ÿ“Š Table Extraction - Parameter tables from scientific publications
  • ๐Ÿงฎ Formula Preservation - Mathematical formulas in LaTeX format

Data Management:

  • ๐Ÿ“ฅ Download Queue System - Robust multi-step data acquisition with JSONL persistence (Details)
  • โšก Enhanced Two-Tier Caching - 30-50x speedup on repeat content access (0.2-0.5s cached)
  • ๐Ÿ”„ Workspace Restoration - Seamless session continuity (Details)
  • ๐Ÿ“‚ Pattern-based Dataset Loading - Smart memory management
  • ๐Ÿ’พ Session Persistence - Automatic state tracking
  • ๐Ÿ’พ WorkspaceContentService - Type-safe caching for research content (Details)

Analysis & Workflows:

  • ๐Ÿงช Formula-Based Differential Expression - Complex experimental designs with R-style formulas (Details)
  • ๐Ÿค– Enhanced Data Expert Agent - New restoration tools and workflows

Infrastructure:

  • ๐Ÿ—๏ธ Provider Infrastructure - Modular, extensible architecture for content retrieval
  • ๐Ÿ—๏ธ Agent Registry Auto-Discovery - Dynamic agent configuration (Details)
  • โŒจ๏ธ Enhanced CLI - Arrow navigation and command history
  • ๐ŸŽจ Rich Interface - Professional orange branding
  • โšก Performance - Optimized startup and processing

๐Ÿ—‚๏ธ Feature Availability Matrix

Quick reference for feature availability across deployment modes.

Core Features by Deployment Mode

FeatureLocalCloud
Content Intelligence
Docling structure-aware parsingโœ…โœ…
Two-tier publication accessโœ…โœ…
ContentAccessServiceโœ…โœ…
Provider infrastructure (5 providers)โœ…โœ…
Analysis Capabilities
Simple DE (two-group)โœ…โœ…
Formula-based DEโœ…โœ…
Agent-guided formulasโœ…โœ…
Protein visualization (batch)โœ…โœ…
Protein visualization (interactive)โœ…โš ๏ธ
Data Management
Basic workspaceโœ…โœ…
WorkspaceContentServiceโœ…โœ…
Download queue (JSONL)โœ…โœ…
Two-tier cachingโœ…โœ…
Infrastructure
Auto agent discoveryโœ…โœ…
FTP retry logicโœ…โœ…

Legend:

  • โœ… Full support
  • โš ๏ธ Partial support (see notes below)

Note: Interactive PyMOL visualization requires local GUI support. Cloud mode supports batch image generation only.

For detailed feature documentation, see the Migration Guide.

๐Ÿ“ Documentation Standards

This documentation follows these principles:

  • Progressive Disclosure: Start simple, dive deeper as needed
  • Task-Oriented: Organized by what you want to accomplish
  • Example-Rich: Real datasets and practical code examples
  • Cross-Referenced: Links between related topics
  • Maintained: Regular updates with each release

๐Ÿค Contributing to Documentation

Found an issue or want to improve the documentation?

  1. Check our developer overview
  2. Submit a pull request to the docs/wiki directory
  3. Follow our code style guidelines

Documentation for Lobster AI v0.2+ | Last updated: 2025

Made with โค๏ธ by Omics-OS

On this page