Skip to content

Overview

Sources are the knowledge foundation of Lattice. Add documents, URLs, and repositories to build a searchable knowledge base that grounds AI responses.

Supported Source Types

TypeDescriptionProcessing
pdfPDF documentsText extraction, chunking, embeddings
urlWeb pagesHTML parsing, content extraction
githubGitHub repositoriesREADME, code files, documentation
youtubeYouTube videosTranscript extraction
google_docsGoogle DocumentsAPI-based content retrieval
markdownMarkdown filesDirect parsing
textPlain textDirect chunking
artifactPromoted artifactsConverted from Studio

Auto-Classification

Sources are automatically categorized based on content analysis:

  • requirements - SLAs, PRDs, RFPs, specifications
  • research - Academic papers, blog posts, transcripts
  • vendor - Pricing pages, model cards, API documentation
  • architecture - System designs, diagrams, technical specs
  • benchmarks - Leaderboards, evaluations, comparisons
  • tutorial - Guides, tutorials, learning content

Processing Pipeline

  1. Ingestion: Content fetched/uploaded
  2. Extraction: Text extracted from format
  3. Chunking: Split into semantic chunks (~1000 tokens)
  4. Embedding: Vector embeddings generated (if enabled)
  5. Indexing: Full-text search index updated