ZhenIns Public Whitepaper
Version: 1.1
Published: 19 May 2026
Applicable to: Policyholders, insurance advisors, brokerage firms, and ecosystem partners
Previous version: v1.0 (2026-05-10)
Change Summary (v1.0 → v1.1)
- Unified Identity & SSO (ADR-009): Consumer SMS authentication and advisor OAuth login both connect to the
auth.zhenrobot.comunified identity center, enabling cross-platform identity recognition and passwordless recovery. - Consumer Session Persistence: Full CRUD support for conversation history (create, read, rename, delete). Session data is no longer limited to browser local storage.
- Precise Token Counting: Introduced
tiktokenfor per-token metering, replacing traditional character-length estimation. Provides accurate context-budget management for complex insurance reasoning. - Consent Full-Cycle Closure: From AI suggestion trigger → profile authorization confirmation → advisor receipt → consumer revocation, the entire flow is stateful, auditable, and revocable.
- Frontend State Race Protection: Chat session layer and business-state layer are decoupled, eliminating duplicate loading and quota-state drift during mount/unmount cycles.
- End-to-End Test Full Coverage: lint + build + E2E + backend integration test + production smoke 5-step curl all纳入 CI gate.
1. Overview
1.1 What is ZhenIns
ZhenIns is the AI-powered insurance consulting frontstage of the Zhen ecosystem. It serves policyholders through a chat-first interface for coverage-gap analysis, plan recommendations, and document preparation guidance. When a conversation touches a complex scenario—such as replacing an existing policy, conflicting health disclosures, severe budget constraints, or special-risk populations—the system automatically routes the case to a licensed insurance advisor for human review via the Agent2Human protocol.
ZhenIns does not sell insurance products directly. Its role is to help clients answer three questions:
- Where are my coverage gaps?
- Is my current plan sufficient?
- What should I do next, and what documents should I prepare?
1.2 Who We Serve
| Role | Core Need | Service Form |
|---|---|---|
| Policyholders (primary users) | Understand gaps, get recommendations, prepare documents | AI chat + human fallback + service packages |
| Insurance Advisors | Receive complex referrals, communicate securely, manage leads | Agent2Human workflow + advisor-workspace subscription |
| Brokerage Firms / Enterprises | Multi-seat management, compliance reports, API integration | Brokerage / Enterprise plans |
| Ecosystem Partners | Embed AI insurance consulting into third-party workflows | OpenClaw Skill licensing |
1.3 Core Commitments
- AI first, human fallback: AI handles standard cases; complex cases are escalated automatically or on demand. The client always has a choice.
- Data minimization: Full sensitive data is not collected before explicit authorization; every data request states its purpose; users may revoke authorization at any time.
- Compliance audit trail: Consultation requests, recommendation outputs, human-review nodes, and authorization records are all logged and auditable.
- No final-conclusion pretense: AI outputs are recommendation summaries, not formal underwriting conclusions. Final coverage, claims, and compliance determinations are executed by licensed insurers.
2. Position in the Zhen Ecosystem
2.1 Business Frontend Shell
ZhenIns is a Business Frontend Shell in the Zhen ecosystem. It is responsible for public client acquisition, consultation interaction, plan explanation, authorization documentation, and the compliance whitepaper. It does not maintain a standalone identity system, billing platform, or organization backend.
The shell architecture ensures that insurance-grade compliance, audit, and data-minimization requirements are met centrally, not rebuilt in every vertical site.
2.2 Relationship to zhen-brain-core
zhen-brain-core handles insurance reasoning, plan generation, risk assessment, and memory.
Critical constraint: ZhenIns must not call zhen-brain-core directly in production. All requests are forwarded through the zhen-platform-core gateway. The fixed execution chain is:
ZhenIns Frontend → zhen-platform-core → zhen-brain-core
This constraint ensures identity verification, permission checks, usage auditing, and compliance logging are enforced at the gateway layer, preventing direct exposure of reasoning interfaces from the frontstage.
2.3 Relationship to zhen-platform-core
zhen-platform-core is the unified operating kernel of the Zhen ecosystem, responsible for:
- Identity & Organization: consumer SMS authentication, advisor OAuth login, workspace and organization management
- Billing & Entitlements: membership subscriptions, orders, refunds, invoices
- Payments: unified checkout (Alipay / WeChat Pay), with the pricing catalog as the single source of truth
- Audit Logging: session metadata, authorization records, compliance tags
ZhenIns does not maintain a second identity or billing system locally. It consumes platform-core services via API and surfaces results to users.
2.4 Relationship to zhen-platform-console
zhen-platform-console is the unified post-login control plane.
ZhenIns retains only transitional entry points for logged-in management; long-term ownership belongs to the console. The following actions redirect to the console:
- Viewing orders and entitlements
- Managing API Keys
- Team seats and organization settings
- Downloading compliance reports and audit files
- Subscription management and invoices
The frontstage's responsibility is to guide users to the correct console entry point, not to rebuild the control plane inside the site.
2.5 Why This Division is Necessary
The insurance industry has far stricter regulatory requirements than typical SaaS:
- Identity authenticity: Insurance transactions are bound to verified identity; anonymous sessions are insufficient.
- Authorization traceability: Any data collection and processing must have explicit authorization records.
- Data minimization: The cloud does not hold raw client data by default; uploads are authorization-driven.
- Centralized risk control: Reasoning, assessment, and sandbox capabilities are governed by a unified brain to avoid inconsistent interpretations across frontstages.
Centralizing identity, billing, risk control, and audit is the minimum viable architecture to meet these requirements.
3. Agent2Human Protocol
3.1 Protocol Definition
Agent2Human is ZhenIns's collaboration protocol. Its core principle: AI handles standard cases; human advisors handle complex cases.
The AI advisor continuously assesses conversation complexity. When complexity exceeds a safe threshold, or when the client explicitly requests human assistance, the system automatically transfers a de-sanitized summary of the conversation context to a licensed insurance advisor.
3.2 Automatic Escalation Triggers
The following scenarios trigger automatic human handoff:
- Complex health disclosure: Prior conditions, multiple hospital records, non-standard underwriting status
- Existing policy replacement: Surrender-loss calculations, waiting-period overlap, liability conflicts
- Liability conflicts: Duplicate coverage or gap hedging across multiple family policies
- Severe budget constraints: Budget and coverage needs are too far apart, requiring trade-off analysis
- Special populations: High-risk occupations, non-standard health, high-net-worth individuals, foreign nationals, complex family structures
3.3 Client-Initiated Escalation
Clients may request human review at any stage, for example:
"I'd like to speak with a professional insurance advisor." "I'm not sure about this recommendation." "Can someone confirm this for me?"
Client-initiated escalation bypasses AI complexity assessment and enters the human queue directly.
3.4 Advisor Qualifications
Advisors receiving referrals must meet:
- Valid professional license
- Minimum 2 years of experience
- Quarterly platform re-certification
- Full-process audit logging
Service standards:
| Metric | Commitment |
|---|---|
| Acceptance response | ≤ 2 hours |
| First contact | ≤ 4 hours |
| Full-process logging | 100% |
| Client evaluation | Reviewable, complaint-eligible, refundable |
The advisor side is equipped with a multi-channel notification system—on-site notifications, Web Push, SMS, and Email—ensuring high-intent leads (score ≥ 7) reach advisors within 15 seconds.
3.5 Data De-sanitization Boundary
AI conversations may collect complete data (ID numbers, detailed health records, asset details). When transferring to a human advisor, only a de-sanitized summary is shared:
- Age bracket / household structure / budget / occupation type
- Coverage-gap summary
- Complex-point description (de-sanitized)
- Client request
Not shared:
- Complete AI conversation history
- Detailed client identity (before authorization)
- AI internal assessment logic
- Complete household asset details
If the human advisor requires full data, they must issue a secondary authorization request to the client. The data is only released after explicit client consent. Clients may revoke consent at any stage; upon revocation, the advisor workstation immediately loses profile access via real-time WS.
3.6 Consent Full-Cycle
Handoff Consent is the data-authorization core of the Agent2Human protocol, forming a complete closed loop:
- Trigger: AI identifies complex needs during conversation, returning a
suggested_handoff: truemarker - Authorization confirmation: Frontend pops a Consent Modal, listing profile items to be shared (age, city, household, budget, health, etc.), with sensitive fields marked
- Advisor receipt: Advisor workstation displays real-time leads with AI-extracted structured profiles and conversation summaries
- Revocation: Consumers may revoke consent at any time from "My Service Progress"; advisor side receives a
lead_revokedWS event in real time, and profile access is immediately revoked - Audit: Authorization, sharing, and revocation are fully logged to the
notificationstable andlead_eventslog
3.7 Quality Assurance
- Quarterly review of advisor licenses
- Monthly review of service scores and complaint rates
- Full conversation logging for compliance auditing
- Refund or advisor replacement available on dissatisfaction
- Platform outbound E2E tests covering the full consumer journey, advisor workspace, payment orchestration, and authorization chain (32+ test cases)
4. Data Governance and Compliance
4.1 Minimization and Tiered Authorization
ZhenIns follows the Personal Information Protection Law requirements, applying data minimization and tiered authorization:
| Tier | Content | Default State |
|---|---|---|
| Basic | De-sanitized summary (age bracket, household, budget range, occupation) | Enabled by default |
| Detailed | Contact info, health disclosure, existing policies, financial information | Requires explicit client consent |
| Revocable | Client may withdraw granted permissions at any time | Always available |
Hard rules:
- Full sensitive data may not be requested before explicit authorization.
- "Upload entry exists" may not be stated as "system has received official documents."
- "Recommendation summary" may not be stated as "final coverage result."
4.2 Data Holdings Boundary
| Data Type | Location | Note |
|---|---|---|
| Advisor client list, communication memos, unsubmitted draft plans | ZhenIns local / advisor side | Temporary content created during service |
| Authorized quotes / underwriting summaries | zhen-platform-core | Formal business data after client authorization |
| Session metadata, compliance logs, audit exports | zhen-platform-core | Supports cross-platform audit and compliance checks |
| Membership / orders / entitlements / invoices | zhen-platform-core | Held by unified operating kernel |
Key principle: The cloud does not hold raw client data by default. All uploads are authorization-driven and necessary for business purposes.
4.3 Transport and Storage Security
- Transport: TLS 1.3 full-path encryption
- Storage: AES-256 encryption for sensitive data
- Audit: Periodic security audits and penetration testing
- Logging: Access logs retained in accordance with regulatory requirements
4.4 Compliance Logging and Audit
The following events are mandatorily logged:
- Time, scenario, and user identifier (de-sanitized) of each consultation request
- AI recommendation summary output
- Agent2Human trigger reason and human review outcome
- Client authorization and withdrawal records
- Payment order status changes
Log data supports compliance audit exports and is managed centrally by zhen-platform-core.
5. Technical Architecture
5.1 Three-Layer Execution Chain
ZhenIns's production execution chain is frozen as a three-layer structure:
Business Frontend (ZhenIns)
→ zhen-platform-core (unified gateway)
→ zhen-brain-core (insurance reasoning and risk control)
Frontend responsibilities: initiate business requests, display status, surface results, render conversations.
Gateway responsibilities: identity verification, permission validation, usage auditing, compliance logging.
Brain responsibilities: insurance reasoning, plan generation, risk assessment, memory retention.
5.2 SSE Real-Time Streaming Consultation
The frontend uses Server-Sent Events (SSE) for real-time AI chat streaming. After a user sends a message, the AI generates responses token-by-token without waiting for the full response.
Technical parameters:
- First-token latency: 3–6 seconds (influenced by external inference API queuing)
- Token buffer: 30ms timeout / ≥8 character flush
- Frontend typewriter effect: 75ms per CJK character, 22ms per Latin word
- Burst smoothing: 25ms backend smoothing to prevent frontend stutter
- Precise token metering: Introduced
tiktoken+ CJK-aware fallback, replacinglen(text)withestimate_tokens()across the entire stack to ensure accurate context budgets for complex insurance scenarios
5.3 Payment Flow
Payments are progressively centralized to the zhen-platform-core unified checkout. New order flow:
User confirms service package → platform-core generates order → Alipay / WeChat Pay
→ Payment success → entitlement sync → ZhenIns receipt page
The local payment system (previously direct Alipay / WeChat SDK) was formally retired on 2026-05-16; /billing/recharge and legacy webhook endpoints return 410 Gone. The frontstage retains only payment orchestration pages, result handling, and receipt logging.
5.4 Scoped API Key and OpenClaw Skill
- Scoped API Key: Cross-platform access uses unified identity + unified billing + scope-limited permission keys, avoiding universal-key risks
- OpenClaw Skill: Exposes AI insurance consulting capabilities via standardized interfaces to third-party workflows, supporting recognition of human-review nodes and de-sanitized handoff in local AI environments.
insurance-broker@2.0.2has been published to ClawHub with a singleproxyaction and whitelist-protected endpoints
5.5 Frontend State Layering and Race Protection
The Chat frontend has completed a state-layering refactor, decoupling session state (conversations/messages/streaming) from business state (alert/consumer/handoff):
chat-session-context.tsx: Pure session container responsible for CRUD, SSE consumption, and loading stateschat-business-context.tsx: Business-state container responsible for quotas, errors, and handoff suggestionsalert-reducer.ts: Action-Based guard with a unified entry point for state changes
Key protection: mount-phase loadConversations is guarded by hasInitialLoadedRef to prevent duplicate execution; clearQuotaState(reason) provides a unified entry point to eliminate race conditions.
6. Business Model
6.1 Client Side
| Service | Price | Note |
|---|---|---|
| Basic AI consultation | Free | Coverage-gap analysis, plan direction recommendations |
| Deep plan customization | Paid | Personalized detailed protection design |
| Multi-plan comparison | Paid | Horizontal comparison of 2–3 plans |
| Policy health check | Paid | Review of existing policies and gap diagnosis |
| Advisor human review | Per service package | Complex-scenario handoff to licensed advisors |
Consumer conversation quotas are managed by the zhen-platform-core entitlement system; the local free_messages_used field has been retired.
6.2 Advisor Side
| Product | Form | Includes |
|---|---|---|
zhenins.advisor.pro | Personal monthly membership | Advisor workspace, Agent2Human skill nodes, client management |
zhenins.agency.seat | Team seat | Multi-advisor collaboration, organization permissions, unified settlement |
Advisor login is unified through OAuth (auth.zhenrobot.com); the local password system was formally retired on 2026-05-07. The advisor workspace includes real-time WS push, lead pools, multi-dimensional filtering, performance rankings, and commission statistics.
6.3 Enterprise Side
| Product | Form | Target |
|---|---|---|
| Brokerage | Annual fee | Small-to-medium insurance brokerages |
| Enterprise | Annual fee + implementation fee | Large organizations, with on-premise deployment options |
Enterprise plans include multi-seat management, organization permissions, API Key integration, compliance reports, and audit exports.
6.4 Pricing Centralization
Product pricing (advisor memberships, team seats, enterprise plans) is no longer maintained locally by ZhenIns. The zhen-platform-core catalog serves as the single source of truth. The frontstage is responsible only for display and purchase guidance.
7. Roadmap and Milestones
7.1 Completed
| Project | Description | Completed |
|---|---|---|
| Consumer SMS authentication (FIC-01) | Device recovery, upgrade merge, SMS verification login | May 2026 |
| Advisor OAuth unified login (FIC-02) | Retired local passwords, integrated unified authentication | May 2026 |
| Cookie-only sessions | Removed localStorage token dependency, switched to HttpOnly cookies | May 2026 |
| Payment checkout centralization (ADR-008 Phase 1 & 3) | Unified checkout via platform-core; local payment system retired | May 2026 |
| Consumer conversation CRUD | Full GET/POST/PATCH/DELETE API, frontend sidebar support | May 2026 |
| Consumer SSO (ADR-009) | auth.zhenrobot.com SMS login + platform-core identity sync | May 2026 |
| Precise token counting | tiktoken + CJK-aware fallback, full-stack replacement | May 2026 |
| Handoff Consent full-cycle | Authorization → receipt → revocation → WS real-time sync | May 2026 |
| Advisor multi-channel notifications (A+B+C+D) | On-site + Web Push + SMS + Email, end-to-end verified | May 2026 |
| Frontend state race protection | ChatContext layered refactor + Action-Based guard + E2E persistence assertions | May 2026 |
| Design-system upgrade | Full typography, animation, and interaction refresh | May 2026 |
7.2 In Progress
| Project | Description |
|---|---|
| SSE performance and timeout optimization | Mitigating occasional first-token latency in complex insurance queries |
| Generative Engine Optimization (GEO) | Multi-version permanent whitepaper URLs, Schema.org structured data, RSS feed |
7.3 Planned
Feature expansion is driven by the Zhen ecosystem roadmap and is not released independently by ZhenIns. The frontstage maintains its "shell" positioning; new capabilities are inherited naturally through platform-core and brain-core upgrades.
Appendix A: Glossary
| Term | Client-facing Explanation |
|---|---|
| Agent2Human | Collaboration protocol: AI handles standard cases; complex cases are automatically or manually transferred to human insurance advisors |
| Platform Console | Unified management center after login, for orders, entitlements, and team settings |
| Brain-Core | The ecosystem's AI insurance reasoning system; ZhenIns connects indirectly through a gateway |
| Entitlement | Rights and permissions that determine which features and services you can use |
| SSE | Real-time streaming conversation technology that makes AI responses appear word-by-word |
| Scoped API Key | Permission-scoped access key for secure third-party integration |
| Consent | Authorization agreement controlling when and to what extent your profile is shared with advisors |
| Token | The smallest unit of text processed by AI; precise counting ensures complex queries stay within budget |
Appendix B: Compliance Document Index
- Authorization
- Privacy Policy
- Terms of Service
- Security & Compliance
- Documentation Center
- Download Center
This document is the ZhenIns public whitepaper v1.1. Detailed technical specifications, ecosystem master documents, and internal interface definitions are maintained in upstream architecture documents and are not superseded by this whitepaper. Historical versions are permanently accessible: /whitepaper/v/v1.0.