EnterpriseHub

Executive Summary

Real estate teams lose 40% of leads when response time exceeds the 5-minute SLA. EnterpriseHub automates lead qualification, follow-up scheduling, and CRM sync across three specialized AI bots — so no lead goes cold. Built for real estate brokerages and agencies; production-validated with 7,678 collectible tests and a full observability stack.

Business Impact

EnterpriseHub delivers quantified outcomes based on production deployment (Case Study CS001):

Outcome	Result	How Measured
95% Faster Lead Response	45 min → 2 min qualification	Time from lead submission to Q0-Q4 score
$240K Annual Savings	Automated qualification vs. manual review	Agent hourly rate × hours saved × annual volume
133% Conversion Increase	12% → 28% lead-to-appointment rate	Qualified leads converted to appointments/closed deals
89% Token Cost Reduction	93K → 7.8K tokens per workflow	Token usage before/after 3-tier cache
88% Cache Hit Rate	L1 59% + L2 21% + L3 8%	Validated Feb 11, 2026
92% Qualification Accuracy	Q0-Q4 framework correctness	Validated Feb 11, 2026
3x Agent Productivity	Agents focus on high-value prospects	45 min → 2 min per lead

See CASE_STUDY.md and BENCHMARK_VALIDATION_REPORT.md for methodology.

Architecture

graph TB
    subgraph Clients["Client Layer"]
        LB["Lead Bot :8001"]
        SB["Seller Bot :8002"]
        BB["Buyer Bot :8003"]
        BI["Streamlit BI Dashboard :8501"]
    end

    subgraph Core["FastAPI Core — Orchestration Layer"]
        CO["Claude Orchestrator<br/><small>Multi-strategy parsing, L1/L2/L3 cache</small>"]
        AMC["Agent Mesh Coordinator<br/><small>22 agents, capability routing, audit trails</small>"]
        HO["Handoff Service<br/><small>0.7 confidence, circular prevention</small>"]
    end

    subgraph CRM["CRM Integration"]
        GHL["GoHighLevel<br/><small>Webhooks, Contact Sync, Workflows</small>"]
        HS["HubSpot Adapter"]
        SF["Salesforce Adapter"]
    end

    subgraph AI["AI Services"]
        CL["Claude<br/><small>Primary LLM</small>"]
        GM["Gemini<br/><small>Analysis</small>"]
        PP["Perplexity<br/><small>Research</small>"]
        OR["OpenRouter<br/><small>Fallback</small>"]
    end

    subgraph RAG["Advanced RAG System"]
        BM25["BM25 Sparse Search"]
        DE["Dense Embeddings"]
        RRF["Reciprocal Rank Fusion"]
        VS["ChromaDB Vector Store"]
    end

    subgraph Data["Data Layer"]
        PG[("PostgreSQL<br/><small>Leads, Properties, Analytics</small>")]
        RD[("Redis<br/><small>L2 Cache, Sessions, Rate Limiting</small>")]
    end

    LB & SB & BB -->|"Qualification<br/>Requests"| Core
    BI -->|"Analytics<br/>Queries"| Core
    Core -->|"CRM Sync"| CRM
    CO -->|"LLM Calls"| AI
    CO -->|"Retrieval"| RAG
    Core -->|"Read/Write"| Data
    RAG --> VS
    HO -->|"Bot Transfer"| Clients

Live Demo


Dashboard	https://ct-enterprise-ai.streamlit.app
API Docs	Swagger UI (40+ routes, available on local/staging deploy)
Demo login	`demo_user` / `Demo1234!`
Admin login	`admin` / `Admin1234!`

These credentials are for the public demo instance only. They access synthetic data and are not production secrets.

Deploying your own instance? See Deployment below. Run python scripts/seed_demo.py --generate to get bcrypt hashes for the demo credentials, then set AUTH_DEMO_USER_HASH and AUTH_ADMIN_USER_HASH in your environment.

Architecture Tour: 5 Systems Worth Reviewing

A guide for technical reviewers with 5 minutes. Each entry names the file, explains the problem it solves, and points to the key pattern.

1. Agent Mesh Coordinator

File: ghl_real_estate_ai/services/agent_mesh_coordinator.py (725 lines)

Problem: When multiple AI agents (lead qualification, property matching, CMA generation, document processing) run concurrently, you need a governance layer that prevents runaway costs, enforces SLAs, and routes tasks to the cheapest capable agent.

Key classes: AgentMeshCoordinator, MeshAgent, AgentTask, AgentMetrics

Pattern: Each agent registers with a cost_per_token and sla_response_time. Task routing uses a weighted scoring function across four dimensions: success rate (40%), current load (25%), cost efficiency (20%), and average response time (15%). Emergency tasks get a 1.5x score multiplier. Four background coroutines run continuously: health monitor (30s heartbeat), cost monitor (5min), performance monitor (2min), and cleanup. If hourly spend crosses $50, mesh activity is throttled; at $100, emergency_shutdown() cancels all active tasks and sets every agent to MAINTENANCE.

Outcome: 22 registered agents across the platform with per-agent P50/P95 tracking and automatic load rebalancing when queue time exceeds 30 seconds.

Training foundation: Microsoft AI & ML Engineering (75h) — agent orchestration patterns, SLA-based routing, performance monitoring.

2. 3-Tier LLM Cache

Files: ghl_real_estate_ai/services/claude_orchestrator.py (1,935 lines), ghl_real_estate_ai/services/cache_service.py, ADR: docs/adr/0001-three-tier-redis-caching.md

Problem: A single lead qualification workflow without caching consumes ~93K tokens. With hundreds of concurrent conversations referencing the same property data and market context, the cost compounds quickly.

Pattern:

L1 (in-memory LRU): MemoryCache with 1,000-item capacity and LRU eviction. Sub-1ms access. Handles repeated lookups within the same active qualification session.
L2 (Redis): Shared across all FastAPI workers. Under 5ms access. Default 15-minute TTL for conversation context, 1 hour for market data. Handles cross-request deduplication.
L3 (PostgreSQL): Persistent, under 20ms access. Stores historical results for analytics and A/B comparisons. Cache keys incorporate conversation_id + message_hash + model_version to prevent stale reads after model upgrades.

A background task promotes frequently accessed L1 keys to L2.

Outcome: 89% token cost reduction (93K to 7.8K tokens per workflow); 88% overall hit rate (L1 59% + L2 21% + L3 8%). P95 latency for cached queries drops from 800ms to under 200ms.

Training foundation: Duke LLMOps (48h) — multi-tier caching, cost optimization, token budgeting. IBM GenAI Engineering (144h) — LangChain orchestration, model strategy patterns.

3. Compliance Response Pipeline

Files: ghl_real_estate_ai/services/jorge/response_pipeline/pipeline.py (78 lines), ghl_real_estate_ai/services/jorge/response_pipeline/factory.py

Problem: Every outbound bot message must pass through TCPA opt-out detection, FHA/RESPA compliance, AI disclosure rules, language mirroring, and SMS length constraints before it leaves the system. These are independent concerns that fail differently.

Pattern: ResponsePostProcessor chains ResponseProcessorStage instances. Each stage receives a ProcessedResponse and returns one with an updated action. If any stage sets ProcessingAction.SHORT_CIRCUIT, the remaining stages are skipped. The default pipeline (created by create_default_pipeline()) runs 7 stages in order:

LanguageMirrorProcessor — detects contact language, sets context.detected_language
TCPAOptOutProcessor — pattern-matches opt-out phrases, short-circuits with acknowledgment, applies TCPA-Opt-Out and AI-Off GHL tags
ConversationRepairProcessor — detects conversation breakdown, graduated repair ladder
ComplianceCheckProcessor — FHA/RESPA enforcement via ComplianceMiddleware.enforce(), replaces blocked response with a safe fallback
AIDisclosureProcessor — no-op stub; disclosure triggers only when a lead explicitly asks
ResponseTranslationProcessor — mirrors user language for fixed qualification and scheduling messages
SMSTruncationProcessor — enforces 320-character SMS limit, truncates at sentence boundaries

Outcome: Every bot message is compliance-checked before delivery. Stage failures are caught per-stage and logged without dropping the message.

Training foundation: IBM RAG & Agentic AI (24h) — agentic pipeline design, safety constraints. Vanderbilt Generative AI Strategic Leader (40h) — responsible agent behavior patterns.

4. Cross-Bot Handoff with Performance Routing

Files: ghl_real_estate_ai/services/jorge/jorge_handoff_service.py (1,660 lines), ghl_real_estate_ai/services/jorge/handoff_router.py

Problem: A lead who starts with the Lead Bot and reveals buyer or seller intent needs to transfer to the right specialist bot without losing conversation context or creating infinite handoff loops.

Key classes: JorgeHandoffService, HandoffDecision, EnrichedHandoffContext, HandoffRouter

Pattern:

Confidence thresholds per direction: Lead-to-Buyer/Seller at 0.7; Buyer-to-Seller at 0.8; Seller-to-Buyer at 0.6
Circular prevention: Same source-to-target pair is blocked within a 30-minute window
Rate limiting: 3 handoffs per hour, 10 per day per contact
Pattern learning: JorgeHandoffService adjusts thresholds dynamically after at least 10 outcome data points per route (MIN_LEARNING_SAMPLES = 10)
Performance routing: HandoffRouter.should_defer_handoff() defers the transfer when target bot P95 exceeds 120% of its SLA or error rate exceeds 10%. Deferred handoffs retry after a 30-minute cooldown with a maximum of 3 attempts.

The EnrichedHandoffContext dataclass carries qualification score, budget range, CMA summary, and urgency level so the receiving bot can skip re-qualification.

Outcome: blocked_by_performance and blocked_by_circular tracked as named analytics fields. Handoff success rate and processing time are available via get_analytics_summary().

Training foundation: Microsoft AI & ML Engineering (75h) — confidence scoring, performance routing. IBM GenAI Engineering (144h) — conversation context design, multi-agent coordination.

5. A/B Testing Service

File: ghl_real_estate_ai/services/jorge/ab_testing_service.py (849 lines)

Problem: Comparing bot prompt variants or response tone strategies without deterministic assignment produces inconsistent experiences: the same contact could see different variants across sessions.

Key classes: ABTestingService (singleton), VariantStats, ExperimentResult, StatisticalAnalyzer (in ab_testing_framework.py)

Pattern: Variant assignment hashes experiment_id + contact_id (SHA-256) and maps the result to a bucket. The same contact always gets the same variant. Significance is evaluated with a two-proportion z-test: StatisticalAnalyzer.calculate_statistical_significance() computes a pooled standard error and z-score, then approximates a two-tailed p-value using math.erf. Minimum sample size is calculated before an experiment starts using configurable significance_level (default 0.05) and statistical_power (default 0.8). Four pre-built experiment identifiers cover response tone, follow-up timing, CTA style, and greeting style. An optional ABTestingRepository provides write-through PostgreSQL persistence without blocking the in-memory caller.

Outcome: Experiments run with no risk of variant drift per contact. Results surface is_significant, p_value, and winner in ExperimentResult.

Training foundation: Duke LLMOps (48h) — model A/B testing with statistical significance, prompt variant evaluation. Google Advanced Data Analytics (200h) — z-test methodology, power analysis.

→ Full cert-to-code mapping: docs/certifications.md (1,398h across 15 certifications)

Screenshots (Live Demo)

Executive Command Center	Lead Intelligence

3-Tier Cache Performance — 89% token cost reduction (93K → 7.8K tokens/workflow)

Tech Stack

Layer	Technology
API	FastAPI (async), Pydantic validation
UI	Streamlit, Plotly
Database	PostgreSQL, Alembic migrations
Cache	Application memory (L1), Redis (L2), PostgreSQL (L3)
AI/ML	Claude (primary), Gemini (analysis), OpenRouter (fallback)
CRM	GoHighLevel (webhooks, contacts, workflows)
Search	ChromaDB vector store, BM25, hybrid retrieval
Payments	Stripe (subscriptions, webhooks)
Infrastructure	Docker Compose

Security

CI runs security scanning (bandit, pip-audit, SQL injection grep) on every push.

Parameterized SQL — all queries use parameterized text() or asyncpg $1 bindings. DDL identifiers validated and double-quoted via utils.sql_safety.quote_identifier(). CI gate rejects any unprotected f-string SQL patterns.
Webhook authentication — Router-level require_ghl_webhook_signature dependency enforces Ed25519 or HMAC-SHA256 signature verification on all GHL webhook routes. Replay protection via X-GHL-Timestamp with 5-minute window.
JWT authentication — 1-hour expiry tokens validated on every protected route
PII encryption — contact data encrypted at rest using Fernet symmetric encryption
Input validation — Pydantic V2 models enforce strict types on all API boundaries
Rate limiting — Redis-backed sliding window: 100 req/min per IP, 200 burst
Compliance pipeline — 7-stage response processing enforces FHA, RESPA, TCPA, CCPA, and SB-243

See .github/workflows/security-scan.yml for the full pipeline.

Architecture Decisions

ADR	Title	Status
ADR-0001	Three-Tier Redis Caching Strategy	Accepted
ADR-0002	Multi-CRM Protocol Pattern	Accepted
ADR-0003	Jorge Handoff Architecture	Accepted
ADR-0004	Agent Mesh Coordinator	Accepted
ADR-0005	Pydantic V2 Migration	Accepted
ADR-0006	Security Framework Consolidation	Accepted
ADR-0007	7-Stage Compliance Response Pipeline	Accepted
ADR-0008	Multi-LLM Orchestration Strategy	Accepted
ADR-0009	Dual-Mode Webhook Signature Verification	Accepted
ADR-0010	Structured Logging with structlog	Accepted

Project Structure

EnterpriseHub/
├── ghl_real_estate_ai/           # Main application
│   ├── agents/                   # Bot implementations (Lead, Buyer, Seller)
│   ├── api/routes/               # FastAPI endpoints
│   ├── services/                 # Business logic layer
│   │   ├── claude_orchestrator.py    # Multi-LLM coordination + caching
│   │   ├── agent_mesh_coordinator.py # Agent fleet management
│   │   ├── llm_observability.py      # LLM cost tracking + tracing
│   │   ├── enhanced_ghl_client.py    # CRM integration (rate-limited)
│   │   └── jorge/                    # Bot services (handoff, A/B, metrics)
│   ├── models/                   # SQLAlchemy models, Pydantic schemas
│   └── streamlit_demo/           # Dashboard UI components
├── advanced_rag_system/          # RAG pipeline (BM25, dense search, ChromaDB)
├── benchmarks/                   # Synthetic performance benchmarks
├── docs/                         # Documentation
│   ├── adr/                      # Architecture Decision Records
│   └── templates/                # Reusable templates for other repos
├── tests/                        # 7,678 tests collectible (unit + integration + security)
├── conftest.py                   # Shared test fixtures
├── render.yaml                   # Render deployment config
└── docker-compose.yml            # Container orchestration

Deployment

Full deployment with PostgreSQL, Redis, migrations, and demo data using Docker Compose.

Prerequisites: Docker and Docker Compose.

git clone https://github.com/ChunkyTortoise/EnterpriseHub.git
cd EnterpriseHub

# One command does everything:
#   1. Starts PostgreSQL 15 + Redis 7 containers
#   2. Waits for Postgres health check (pg_isready)
#   3. Runs Alembic database migrations
#   4. Seeds demo data (scripts/seed_demo_environment.py)
#   5. Starts all application containers
./setup.sh

After setup completes:

Service	URL
Streamlit BI Dashboard	http://localhost:8501
FastAPI Backend	http://localhost:8000 (with `--profile api`)
PostgreSQL	`localhost:5432`
Redis	`localhost:6379`

# Quick start (demo mode — no API keys, no database)
make demo

# Stop all services
docker compose down

# View logs
docker compose logs -f

# Run tests
pytest --tb=short

Monitoring

Capability	Implementation	Key Metric
Token Cost Optimization	3-tier cache (L1 memory, L2 Redis, L3 PostgreSQL) + model routing	93K → 7.8K tokens/workflow (89% reduction)
Latency Monitoring	`PerformanceTracker` — P50/P95/P99 percentiles, SLA compliance	Lead Bot P95 < 2,000ms
Alerting	`AlertingService` — 7 default rules, configurable cooldowns	Error rate, latency, cache, handoff, tokens
Per-Bot Metrics	`BotMetricsCollector` — throughput, cache hits, error categorization	87% cache hit rate
Health Checks	`/health/aggregate` endpoint checks all services	Bot + DB + Redis + CRM status

See docs/OBSERVABILITY.md and BENCHMARKS.md for details.

Related Projects

jorge_real_estate_bots — Three-bot lead qualification system (Lead, Buyer, Seller) - live production
docextract — Production RAG pipeline: PDF upload, async processing, pgvector hybrid search, citation-aware answers
mcp-server-toolkit — 9 MCP servers for LLM tool integration, published to PyPI

Contributing

See CONTRIBUTING.md for development setup, PR guidelines, and code standards.

See CHANGELOG.md for release history.

python -m pytest tests/ -v
python -m pytest --cov=ghl_real_estate_ai --cov-report=term-missing
python -m benchmarks.run_all

License

MIT — see LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1,119 Commits
.beads		.beads
.claude		.claude
.debug_chroma/bdbb8f87-4959-4f05-9562-c3f1abd703a8		.debug_chroma/bdbb8f87-4959-4f05-9562-c3f1abd703a8
.devcontainer		.devcontainer
.gemini		.gemini
.github		.github
.playwright-mcp		.playwright-mcp
.serena		.serena
.streamlit		.streamlit
advanced_rag_system		advanced_rag_system
agentforge		agentforge
ai-devops-suite		ai-devops-suite
alembic		alembic
api-docs		api-docs
assets		assets
auto-claude		auto-claude
backend		backend
benchmarks		benchmarks
billing		billing
concierge_configs		concierge_configs
config		config
configs		configs
content		content
data		data
database		database
deploy		deploy
deployment		deployment
docker/production		docker/production
docs		docs
frontend		frontend
ghl_integration		ghl_integration
ghl_real_estate_ai		ghl_real_estate_ai
grafana		grafana
infrastructure		infrastructure
insight_engine		insight_engine
k8s		k8s
mcp-server-toolkit		mcp-server-toolkit
mcp-servers		mcp-servers
mcp_servers		mcp_servers
models		models
modules		modules
monitoring		monitoring
nginx		nginx
observability		observability
packages		packages
portal_api		portal_api
rag-as-a-service		rag-as-a-service
rag_chatbot_demo		rag_chatbot_demo
reports		reports
research		research
scripts		scripts
security		security
shared-schemas		shared-schemas
shared		shared
src		src
streamlit_cloud		streamlit_cloud
tests		tests
utils		utils
voice-ai-platform		voice-ai-platform
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.railwayignore		.railwayignore
.semgrep.yml		.semgrep.yml
.supermemory_state_2026_02_05.json		.supermemory_state_2026_02_05.json
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
AUDIT_MANIFEST.md		AUDIT_MANIFEST.md
BENCHMARKS.md		BENCHMARKS.md
BENCHMARK_VALIDATION_REPORT.md		BENCHMARK_VALIDATION_REPORT.md
CASE_STUDY.md		CASE_STUDY.md
CHANGELOG.md		CHANGELOG.md
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
Dockerfile		Dockerfile
GEMINI.md		GEMINI.md
LICENSE		LICENSE
METRICS_CANONICAL.md		METRICS_CANONICAL.md
Makefile		Makefile
PINNED_FOR_REVIEW.md		PINNED_FOR_REVIEW.md
README.md		README.md
SECURITY.md		SECURITY.md
alembic.ini		alembic.ini
conftest.py		conftest.py
docker-compose.observability.yml		docker-compose.observability.yml
docker-compose.production.yml		docker-compose.production.yml
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
pyrightconfig.json		pyrightconfig.json
pytest.ini		pytest.ini
render.yaml		render.yaml
requirements-dev.txt		requirements-dev.txt
requirements-ml.txt		requirements-ml.txt
requirements-observability.txt		requirements-observability.txt
requirements.txt		requirements.txt
setup.sh		setup.sh
task.md		task.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

EnterpriseHub

Executive Summary

Business Impact

Architecture

Live Demo

Architecture Tour: 5 Systems Worth Reviewing

1. Agent Mesh Coordinator

2. 3-Tier LLM Cache

3. Compliance Response Pipeline

4. Cross-Bot Handoff with Performance Routing

5. A/B Testing Service

Screenshots (Live Demo)

Tech Stack

Security

Architecture Decisions

Project Structure

Deployment

Monitoring

Related Projects

Contributing

License

About

Uh oh!

Releases

Sponsor this project

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

EnterpriseHub

Executive Summary

Business Impact

Architecture

Live Demo

Architecture Tour: 5 Systems Worth Reviewing

1. Agent Mesh Coordinator

2. 3-Tier LLM Cache

3. Compliance Response Pipeline

4. Cross-Bot Handoff with Performance Routing

5. A/B Testing Service

Screenshots (Live Demo)

Tech Stack

Security

Architecture Decisions

Project Structure

Deployment

Monitoring

Related Projects

Contributing

License

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages