Perplexity: Introducing Hybrid Agentic Inference & Search as Code (SaC)

2. Juni
1 Min. Lesezeit

Key Takeaways:

Perplexity: Introducing Hybrid Agentic Inference & Search as Code (SaC)

Perplexity announced some great updates such as introducing Hybrid Agentic Inference and Search as Code
Hybrid Agentic Inference enables you to split tasks between a local model running on your machine and frontier models in the cloud. This keeps private data on your device and maximizes token efficiency
- Only let Perplexity access a local folder and run a local model via a subagent, then switching to more powerful, expensive models in the cloud
Search as Code changes how Perplexity approaches Search Retrieval and Query Fanouts: Instead of the LLM calling MCP tools, it spawns python scripts to call search functions tailored to the task
- Traditionally, AI systems have treated search as a monolith: an AI model issues a query, the search engine runs its predefined pipeline, and the model consumes the results as context
- However, the most powerful AI systems will require the ability to steer how that context is retrieved, processed, aggregated, and rendered to the model.
- This new architecture empowers models to reach into the search stack itself rather than merely consume its final outputs.
- we expose the components of the search stack as primitives within an SDK. For any request that needs search, a model assembles these primitives on-demand into a retrieval pipeline tailored to that specific request.
- No usage of a traditional Search API: Instead, we've carefully engineered an Agentic Search SDK that exposes the individual building blocks of search at the most atomic level possible.
Do not waste time or thoughts, just test it

Sources: