Model | Relative Credit Usage | Typical Use Case |
---|---|---|
GPT-4o Mini | 1ร | Lightweight chatbots, simple instructions |
GPT-4o | 16ร | Strategic reasoning, agents with memory |
Claude Haiku | 2ร | Midweight contextual interactions |
Claude Opus | 15โ20ร | Deep reasoning and multi-hop workflows |
๐ก Heavier models yield higher accuracy, but also consume more tokens. Choose based on your use case.
Example: A long SQL-generating prompt with schema context and 500-word response can consume up to 3โ5ร more credits than a standard chat exchange.
Feature | Credit Usage | Notes |
---|---|---|
Memory | Free | Stores interaction context for multi-turn dialogue. No credit cost. |
Tools Execution | Fixed Cost | Each invocation of a 3P integration (e.g., Slack, Gmail) uses credits. |
Knowledge Base (Classic) | Indirect | No feature cost, but large documents increase prompt size. |
Semantic Model | Indirect + Inference | Embeds tabular metadata, uses credits when LLM is invoked. |
Knowledge Graph | Indirect + Processing | Graph traversal + node summarization incurs token load. |
Context Relevance Filtering | Moderate | Adds LLM-based ranking layer over retrieved chunks. |
Groundedness Checks | Fixed per use | Post-response factuality validation using internal validators. |
Responsible AI Modules | Variable | Cost depends on number of submodules invoked (e.g., hallucination, profanity). |
Scenario | Approximate Credit Cost |
---|---|
Simple agent query using GPT-4o Mini | 1โ2 credits |
Tool-based agent querying Gmail + Slack | 10โ25 credits |
RAG agent using 5 PDF files with GPT-4 | 20โ60 credits |
Text-to-SQL with Semantic Model (with schema + sample rows) | 15โ50 credits |
Agent using Claude Opus with Context Relevance + RAI | 80โ120 credits |
These values are estimates and depend on prompt structure and content size.
Plan Type | Expiry Rules | Overflow Handling |
---|---|---|
Free Plan | Resets every 30 days | No rollover |
Monthly Plan | Resets monthly | No rollover unless top-up applied |
Annual Plan | Renewed annually | Quota spreads evenly or loaded in full |
One-Time Top-Up | Never expires | Used after subscription credits are exhausted |
Component | Role |
---|---|
Credits | Unit of Lyzr compute usage |
Tracked Dimensions | Model, token count, tools, features |
Replenishment | Monthly/annual plans or one-time top-ups |
Monitoring | In progress (dashboard reporting soon) |
Optimization | Choose light models, minimize token overhead, selective tools |