I can route you to the right public Folium room across services, proof, human control, trust, industries, AI search, and operating-system build paths. This is a guided route finder, not a live AI chat or support desk.
Local private hybrid AI guide
Local, Private, And Hybrid AI Guide
Not every AI workload belongs in the same runtime. Folium helps buyers choose cloud APIs, private endpoints, local models, hybrid routes, RAG, agents, and fallback paths based on privacy, cost, latency, portability, control, and support.
Cloud AI, local AI, private endpoints, and hybrid routing each have a place.
Runtime choice should follow data sensitivity, cost, latency, accuracy, support, and owner review.
Folium can help operate model-agnostic paths across APIs, local models, Ollama, vLLM, SGLang, RAG, agents, databases, and legacy systems.
Runtime placement
Local, private, and hybrid AI choices should be business decisions.
The runtime guide compares cloud APIs, private endpoints, local models, hybrid routes, RAG placement, agent lanes, costs, latency, custody, and fallback.
01Shows Folium can reason beyond one provider.
02Makes privacy, cost, latency, and control tradeoffs visible.
03Protects data custody before architecture locks in.
R
Navigation map
Choose the review route before reading cover to cover.
This packet is meant to support a real decision meeting. Different reviewers should enter through different routes, then come back together around the same controlled next step.
Executive route
Decision first
Start with the cover, visual summary, executive read, controls, first ninety days, and handoff. This route helps leaders decide whether the next move is education, audit, first build, pilot, or operations.
- Outcome
- Risk
- Owner
- Next gate
Operations route
How the work will run
Read the workflow map, procedures, operating roles, metrics, first sprint, and buyer worksheet. This route shows whether staff can actually use, review, and improve the future process.
- Workflow
- Staff
- Support
- Improve
Technical and trust route
Where the boundaries live
Focus on records and work products, controls, risk assumptions, reference work products, source truth, runtime placement, and launch conditions before any private access expands.
- Source
- Access
- Runtime
- Rollback
Buyer session route
Turn reading into a working session
Use the discovery questions, role review route, buyer worksheet, and engagement fit ladder to prepare one process, one owner, one source map, and one next decision.
- Process
- Examples
- Questions
- Decision
Best use: bring one workflow, the people who own it, the systems it touches, the data classes involved, and the decision this packet should help leadership make.
01
Executive read
Local private hybrid AI guide in plain language.
Not every AI workload belongs in the same runtime. Folium helps buyers choose cloud APIs, private endpoints, local models, hybrid routes, RAG, agents, and fallback paths based on privacy, cost, latency, portability, control, and support.
Cloud
Cloud APIs for speed and capability
Cloud models can be excellent when data, policy, cost, and latency fit the use case.
- Capability
- Speed
- Vendor
Private
Private endpoints for control
Private or dedicated routes can reduce exposure and fit stricter operating rules.
- Boundary
- Access
- SLA
Local
Local models for custody
Local AI can keep work closer to the business when privacy, cost, or availability matters.
- Ollama
- vLLM
- SGLang
Hybrid
Hybrid routing for fit
The best architecture may route each task to the runtime that matches its risk and value.
- Router
- Fallback
- Governance
This packet is public-facing. It is written for serious review without exposing private infrastructure, customer data, credentials, live provider wiring, or internal project labels.
02
Workflow map
The operating path should be visible before anyone trusts the outcome.
Folium uses workflow maps to turn broad AI ambition into inspectable work. Each phase names the procedure, the visible output, and the decision gate that prevents excitement from outrunning control.
| Phase | Procedure | Visible output | Decision gate |
|---|---|---|---|
| Classify data | Identify public, internal, confidential, regulated, secret, and blocked data. | Data sensitivity map. | Runtime cannot be chosen before data class. |
| Classify tasks | Separate drafting, retrieval, reasoning, summarization, routing, and action tasks. | Task authority map. | Each job has a role. |
| Choose runtime | Compare cloud, private, local, hybrid, and manual fallback for each task. | Runtime placement matrix. | The route fits the job. |
| Design RAG | Define source ingestion, chunking, metadata, retrieval, freshness, and evaluation. | Knowledge architecture. | Answers stay source-grounded. |
| Design agents | Set tool permissions, human approval, logs, blocked actions, and escalation. | Agent control spec. | Agents cannot exceed authority. |
| Deploy safely | Prepare containers, services, monitoring, API boundaries, cost controls, and rollback. | Deployment guide. | Operations can support it. |
| Evaluate | Test accuracy, retrieval, latency, cost, privacy, failure handling, and fallback. | Evaluation report. | Runtime earns trust. |
| Operate | Monitor usage, update sources, rotate models, review cost, and maintain support. | AI runtime operations loop. | The route stays useful. |
03
Records and work products
The work should leave behind material a buyer can inspect.
A serious engagement should produce more than conversation. Folium packages records, diagrams, checklists, routes, system surfaces, launch gates, and handoff material so the buyer can keep control after the first win.
| Work product | What it contains | How the reviewer uses it |
|---|---|---|
| Runtime placement matrix | Tasks mapped to cloud, private, local, hybrid, or manual fallback. | Prevents one-size-fits-all AI. |
| Data custody map | Where data originates, moves, persists, and gets blocked. | Supports security review. |
| Model route catalog | Candidate providers, local models, endpoints, cost, latency, and quality notes. | Keeps the system model-agnostic. |
| RAG architecture | Sources, embeddings, retrieval, metadata, freshness, and evaluation. | Protects answer quality. |
| Agent permission spec | Tools, allowed actions, approvals, blocked actions, logs, and escalation. | Controls automation risk. |
| Operations guide | Monitoring, support, rollback, incident path, model updates, and source refresh. | Makes runtime supportable. |
04
Procedures
The procedure is the product as much as the technology.
The goal is not to make AI look impressive for one meeting. The goal is to make the operating path repeatable, explainable, reviewable, and safe enough to improve.
- Do not choose local or cloud by ideology; choose by task, data, cost, and support need.
- Classify data before moving it into any model route.
- Separate retrieval from generation and action.
- Keep sensitive workflows close to approved environments.
- Use local models where custody, cost, offline access, or control matters.
- Use cloud APIs where capability, speed, or managed service fit the risk.
- Use hybrid routing when different tasks need different controls.
- Measure latency, cost, quality, support burden, and fallback behavior.
- Define who can approve model changes.
- Keep a model and route lifecycle record.
05
Controls
Governance, quality, and launch gates keep speed honest.
Folium keeps the buyer's next decision tied to observable gates: source truth, authority, access, testing, ownership, support, rollback, and improvement cadence.
| Gate | What must be true | Stop or refine signal |
|---|---|---|
| Data gate | Sensitivity, retention, and custody rules are known. | Data class is unknown. |
| Runtime gate | Each task has a justified route and fallback. | Everything goes to one model by default. |
| Cost gate | Expected usage, token/compute cost, and alerts exist. | Spend can grow silently. |
| Quality gate | Retrieval, generation, and task success are evaluated. | The route is chosen only by preference. |
| Support gate | Monitoring, updates, incidents, and rollback are owned. | No one can operate the runtime. |
06
Discovery questions
The right questions expose the real project.
These prompts help a buyer and Folium decide whether the next step should be education, audit, first build, security review, pilot, or an operating support path.
- What data cannot leave your environment?
- What tasks require the strongest model and which require a controlled local route?
- What latency is acceptable?
- What cost would become painful at scale?
- Who owns model updates and source refresh?
- Which tasks need RAG instead of general model memory?
- Which agent actions require human approval?
- What happens when the preferred model route fails?
07
Visual digestion
Diagrams, charts, and overlays make the work easier to review.
Dense AI work should not only be explained in paragraphs. The reviewer should be able to inspect maps, scorecards, matrices, lanes, and before-after views that reveal where the value and risk live.
Runtime router
A routing diagram from task class and data class to cloud, private, local, hybrid, or manual fallback.
- Task
- Data
- Route
- Fallback
RAG pipeline
A pipeline from source intake to chunking, metadata, embedding, retrieval, answer, and evaluation.
- Source
- Index
- Retrieve
- Evaluate
Cost-control chart
A chart comparing API token spend, local compute, support burden, and expected usage.
- Tokens
- Compute
- Support
- Scale
Model lifecycle board
A board for active, experimental, parked, retired, and replacement routes.
- Active
- Test
- Park
- Retire
08
Operating roles
Every serious AI path needs named owners before it becomes dependency.
The same technology can be safe or unsafe depending on who owns the workflow, data, quality, launch authority, support, and improvement loop. Folium makes those responsibilities explicit so no buyer inherits an orphaned system.
| Role | Owns | Record to inspect |
|---|---|---|
| Executive sponsor | Priority, budget, risk tolerance, stop/continue decision, and expansion timing. | Decision note, value hypothesis, and approval boundary. |
| Business process owner | The day-to-day work, acceptance criteria, staff impact, and operational usefulness. | Workflow map, user feedback, and adoption notes. |
| Technical owner | Systems, APIs, databases, runtime placement, deployment, monitoring, and fallback. | Architecture map, integration log, and support route. |
| Knowledge owner | Source truth, document freshness, policies, retrieval scope, and correction workflow. | Source inventory, freshness cadence, and review exceptions. |
| Security or risk reviewer | Data classes, credentials, access, logs, retention, blocked actions, and incident path. | Boundary map, permission table, and rollback trigger. |
| Folium delivery lead | Build coordination, review file, known limits, quality checks, and handoff completeness. | Launch room, eval record, and improvement backlog. |
09
Quality scorecard
A max-detail packet should tell reviewers how to judge the work.
Folium uses scorecards to make a subjective AI conversation more inspectable. The score is not a substitute for judgment; it helps leadership see whether the next step is education, repair, sandbox, pilot, or operations.
| Score area | Strong signal | Weak signal |
|---|---|---|
| Business fit | The workflow is specific, painful, owned, and tied to measurable operational improvement. | The project is framed as adding AI generally. |
| Source truth | Approved sources are known, fresh, classified, and connected to the answer path. | The system mixes stale, unknown, or unapproved sources. |
| Behavior quality | Representative tasks pass, wrong-answer behavior is known, and edge cases are recorded. | The review build only shows a polished happy path. |
| Authority control | AI actions are separated into draft, retrieve, recommend, route, execute, block, and escalate. | The system can act without visible permission. |
| Staff readiness | Users can explain the tool, correct it, escalate, and understand their role. | Staff feel replaced, confused, or unsupported. |
| Operations readiness | Support, monitoring, rollback, release rhythm, and source refresh are owned. | No one knows who maintains the system after launch. |
10
Thirty / sixty / ninety
The work should have a believable first ninety days.
A controlled first ninety days keeps ambition high without turning uncertainty into production risk. Folium uses the period to move from understanding into a narrow working example, then into reviewable operating rhythm.
| Window | Focus | Expected output |
|---|---|---|
| First 30 days | Discovery, source inventory, first-lane selection, staff interviews, data boundary, and build plan. | Process map, owner map, first-build scope, source list, and launch blockers. |
| Days 31-60 | Working surface, RAG or agent behavior, integration stub, evaluation cases, browser checks, and staff review. | Sandbox, evaluation file, screenshots, known limits, and repair list. |
| Days 61-90 | Architecture review, pilot conditions, governance layer, training guide, support path, and improvement cadence. | Launch room, go/no-go record, operations guide, and next-stage recommendation. |
11
Risk and assumption register
The hidden assumptions should be visible before they become expensive.
Every AI engagement contains assumptions about data, people, systems, cost, behavior, and authority. Folium treats those assumptions as review material, not background noise.
| Assumption | Why it matters | How Folium reviews it |
|---|---|---|
| The source is authoritative | AI can only be as reliable as the sources and business rules it is allowed to use. | Source inventory, owner confirmation, retrieval tests, freshness cadence. |
| The process is ready | A broken process can become a faster broken process when AI is added too early. | Workflow mapping, bottleneck review, owner interview, first-lane narrowing. |
| The runtime fits the data | Cloud, private, local, and hybrid routes carry different privacy, cost, latency, and support tradeoffs. | Runtime matrix, data classification, provider review, fallback plan. |
| Staff will adopt the tool | Adoption fails when users do not understand, trust, correct, or benefit from the system. | Training notes, staff review, feedback loop, manager visibility. |
| Authority is clear | The system can create harm if it sends, updates, approves, or routes without permission. | Permission table, blocked actions, human review, audit trail. |
| The system can be supported | A useful first build becomes fragile if nobody owns incidents, source updates, or cost review. | Support guide, owner map, release rhythm, rollback trigger. |
12
First sprint procedure
The first sprint should produce something real and reviewable.
Folium prefers a narrow first sprint that creates a working surface or review file the buyer can challenge. The first sprint is not the final system; it is the safest way to make the future visible.
- Confirm the single process and the decision the sprint must support.
- Collect approved example material, redacted review records, public references, screenshots, workflow notes, and source rules.
- Define what will be built: portal, dashboard, RAG assistant, agent route, integration adapter, audit file, or launch room.
- Create the visual workflow: intake, source, model or agent route, human review, output, record, and next gate.
- Run representative tasks, edge cases, bad input, missing data, and blocked-action tests.
- Prepare browser screenshots, known limits, support questions, and next-stage blockers.
- Review with staff and leadership before expanding data, access, authority, or dependency.
- End with a decision: stop, refine, rebuild, pilot, or prepare an operating plan.
13
Reference work products
The packet should make the invisible work tangible.
AI work often fails because the important pieces are invisible until something breaks. Folium turns those pieces into work products the buyer can open, print, challenge, and improve.
Process map
A before-and-after workflow showing people, systems, data, decision points, blockers, and expected output.
- Before
- After
- Owner
- Gate
Data boundary map
A map of source classes, approved use, blocked use, retention, provider exposure, and custody.
- Public
- Internal
- Private
- Blocked
Model and agent route
A path showing which model, tool, retrieval source, or agent lane is used and where humans approve.
- Route
- Tool
- Review
- Escalate
Evaluation file
A record of tasks, expected outcomes, failures, repairs, known limits, and acceptance criteria.
- Cases
- Failures
- Repairs
- Limits
Launch room
A board for owners, support, training, rollback, incidents, go/no-go, and improvement backlog.
- Owner
- Support
- Rollback
- Backlog
Handoff guide
A plain-language guide staff can use to understand what the system does, cannot do, and how to report problems.
- Use
- Limit
- Correct
- Report
14
Metrics and review rhythm
The business should know how improvement will be measured.
Folium keeps measurement practical. The first goal is not a perfect dashboard; it is a clear set of signals that shows whether the process is saving time, reducing risk, strengthening staff, or improving customer outcomes.
| Signal | What to watch | Decision it supports |
|---|---|---|
| Time recovered | Manual steps removed, average handling time, repeated work reduced, faster routing. | Should this workflow expand to more users or adjacent processes? |
| Quality improved | Wrong answers, missing sources, correction rate, review exceptions, customer rework. | Is behavior strong enough for pilot or does it need repair? |
| Risk reduced | Blocked unsafe actions, escalations, data-boundary violations avoided, rollback readiness. | Can authority expand or should controls remain tight? |
| Staff confidence | Training completion, feedback volume, adoption friction, override rate, manager notes. | Does the workforce need more support before launch? |
| Cost and runtime | Provider cost, local infrastructure cost, latency, uptime, fallback use, subscription sprawl. | Should runtime placement change? |
| Customer impact | Response speed, consistency, issue resolution, conversion support, satisfaction signals. | Is the capability improving the business outcome? |
15
Role review route
Each reviewer should know what to inspect first.
A max-detail packet is only useful when different reviewers can find their lane quickly. Folium separates executive, operations, technical, security, finance, and staff questions so the buyer can bring the right people into the right part of the review.
| Reviewer | Start with | Decision they support |
|---|---|---|
| Executive sponsor | Value hypothesis, launch gate, first ninety days, and stop/refine/continue choices. | Whether the process deserves a controlled engagement. |
| Operations lead | Workflow map, operating roles, support rhythm, and staff feedback loop. | Whether the future process can be run by the team. |
| Technical lead | Runtime placement, data path, integration surface, monitoring, and fallback. | Whether the architecture can be supported safely. |
| Security or risk reviewer | Data classes, permissions, blocked actions, logs, retention, and rollback. | Whether access can expand beyond public review. |
| Finance or owner | Cost signals, subscription overlap, runtime tradeoffs, labor impact, and support burden. | Whether the first build has a practical business case. |
| Staff user | Plain-language use, limits, escalation, correction path, and training expectations. | Whether the tool strengthens the job instead of confusing it. |
16
Buyer worksheet
The packet should turn into a working session, not only reading material.
Before a call, Folium wants the buyer to gather the real operating pieces that make the review useful. The worksheet keeps the conversation grounded in one process, one owner, one source map, and one next decision.
- Bring one workflow that is slow, risky, expensive, repetitive, customer-visible, or staff-heavy.
- Name the systems touched by the workflow: store, CRM, ERP, inbox, spreadsheet, database, portal, document folder, or legacy application.
- Separate approved public material from internal, customer, regulated, confidential, credential, and blocked material.
- Write down who owns the work today, who reviews exceptions, and who will own the AI-assisted version.
- List the decisions AI may draft, retrieve, recommend, route, block, or escalate, and the decisions that stay human-owned.
- Bring examples of good output, bad output, common exceptions, missing data, and customer-facing risk.
- Name the first useful working surface: dashboard, portal, assistant, queue, control room, commerce lane, integration, or review file.
- Decide what record would make leadership comfortable with the next stage.
17
Engagement fit ladder
The next step should match the maturity of the record.
Folium does not need every buyer to start at the same altitude. The right offer depends on how much process clarity, source truth, owner alignment, and launch readiness already exists.
| If the buyer has | Best next Folium move | Output to expect |
|---|---|---|
| AI interest but no clear process | AI systems audit or first workflow finder. | Pressure map, source inventory, first-lane recommendation, and risk view. |
| A clear process but no working surface | Forward engineering first sprint. | Clickable surface, route map, known limits, and next-stage blockers. |
| A tool that works in parts but not in operations | Architecture and launch readiness review. | Permission map, runtime decision, support model, and go/no-go record. |
| A failed or frightening rollout | AI recovery and staff enablement path. | Issue register, staff training plan, repair roadmap, and confidence loop. |
| Sensitive data or cost pressure | Local, private, or hybrid AI placement review. | Runtime matrix, data custody plan, fallback route, and vendor-exit view. |
| A useful pilot that needs care | AI operations support. | Monitoring rhythm, source refresh, release notes, incident path, and improvement backlog. |
18
Handoff
The last page of a packet should create the next controlled move.
Folium's handoff view separates what can be done now, what needs customer records, what needs approval, and what should wait until the review file is stronger.
| Handoff lane | Owner | Next record |
|---|---|---|
| Data owner | Security or operations | Data custody and sensitivity map. |
| Runtime owner | Technical lead | Runtime placement and deployment guide. |
| Quality owner | Folium and customer reviewers | Evaluation results and known limits. |
| Operations owner | Support lead | Monitoring, source refresh, and model lifecycle record. |
The strongest next step is narrow: one process, one owner, one source map, one working surface, one review file, and one decision gate.
19
Next step
Private AI is not a slogan. It is a placement decision with operating consequences.
Use this guide to decide which workloads should use cloud, private, local, hybrid, RAG, agent, or manual fallback paths.
Bring the process
Name the business process, the systems involved, the people affected, and the decision this PDF should support.
Separate review from production
Keep public examples, sandbox review, pilot access, and production dependency in separate stages with clear owners.
Ask for the record
Request screenshots, browser checks, known limits, launch blockers, support plans, and the next approval path.