Define Your Workload.


Workload Classification
Define your workload type so Lattice can provide relevant recommendations. From high-volume chat to batch processing, each workload has different optimization priorities.
- High-Volume Chat: Real-time conversational AI with strict latency
- RAG Applications: Knowledge base retrieval with accuracy SLOs
- Code Generation: IDE integration with low-latency completion
- Agentic Workflows: Multi-step reasoning with tool use
- Batch Processing: Async workloads with throughput focus
- Training Jobs: Fine-tuning, RLHF, DPO configurations

SLO Configuration
Set your service level objectives and Lattice will recommend models and configurations that meet your latency, throughput, and availability requirements.
- P95 Latency: Target response times (e.g., 1000ms)
- Throughput: Requests per minute requirements
- Availability: Uptime and reliability targets
- Token Limits: Input/output token constraints

Budget & Cost Constraints
Define your monthly budget and cost-per-request targets. Lattice factors pricing into recommendations to find the best value for your requirements.
- Monthly Spend: Budget caps and alerts
- Cost-per-Request: Per-1K-request targets
- Cost vs Quality: Tradeoff preferences
- Reserved vs Spot: Instance pricing options

Compliance & Privacy
Specify compliance requirements like HIPAA, SOC2, and data residency constraints. Lattice filters recommendations to only show compliant options.
- HIPAA/SOC2: Healthcare and enterprise compliance
- Data Residency: Geographic constraints for data
- Audit Requirements: Logging and provenance tracking
- Privacy Controls: PII handling and anonymization
Technical Specifications
Everything you need to know about scenario configuration and workload definition in Lattice.
Workload Categories
- Inference - Chat, RAG, Code, Embedding
- Training - SFT, LoRA, RLHF, DPO
- Comparison - A/B testing scenarios
SLO Parameters
- Latency - P50, P95, P99 targets
- Throughput - Requests/min, tokens/sec
- Availability - Uptime percentages
- Budget - Monthly and per-request limits
Profile Options
- Traffic - Low, Medium, High, Burst
- Risk - Conservative, Balanced, Aggressive
- Compliance - HIPAA, SOC2, GDPR
Learn More About Scenarios
Explore guides and documentation to get the most out of workload configuration.
Journey Guides
Define Your Requirements
Get targeted AI recommendations based on your specific use case and constraints.
Get Lattice for $99