GitPedia

Leo cdp free edition

The binary build of LEO CDP Free Edition for training purposes

From trieu·Updated November 11, 2025·View on GitHub·

**LEO CDP** is an **Open Source AI-first Customer Data Platform (CDP)** framework that empowers organizations to build and operate their own fully customizable CDP infrastructure — with **machine learning and big data** at its core. The project is written primarily in HTML, distributed under the Apache License 2.0 license, first published in 2021. Key topics include: arangodb, big-data, big-data-analytics, cdp, customer-analytics.

<div style="background-color: #F0F8FF; text-align:center; border-radius:8px;"> <img src="https://gcore.jsdelivr.net/gh/USPA-Technology/leo-cdp-static-files@latest//images/leo-cdp-logo.png" alt="LEO CDP framework" style="width:640px;margin:auto;"/> </div>

LEO CDP – The Open Source AI-first Customer Data Platform

LEO CDP is an Open Source AI-first Customer Data Platform (CDP) framework that empowers organizations to build and operate their own fully customizable CDP infrastructure — with machine learning and big data at its core.

Designed for developers, data scientists, marketers, and enterprises, LEO CDP enables unified data collection, real-time customer analytics, audience segmentation, and personalized marketing — all while remaining self-hosted and privacy-friendly.


🚀 Vision & Philosophy

  • The philosophy of Dataism → USPA → LEO CDP
  • Democratize AI-powered data platforms for digital transformation
  • Promote data sovereignty, on-premise intelligence, and open collaboration

🔥 Key Features

  1. Omnichannel Data Collection & Unification
    Collect data from web, mobile, CRM, IoT, POS, social media, and APIs. Unify into rich customer profiles.

  2. Real-Time Customer 360
    Build a complete view of every customer using behavioral, transactional, and third-party data.

  3. AI-based Segmentation & Scoring
    Use clustering, RFM, CLV prediction, churn scoring, and dynamic audiences using ML models.

  4. Behavioral Tracking & Journey Mapping
    Track individual actions and interactions in real time. Map customer journeys across channels.

  5. Predictive Analytics & Insights
    Leverage machine learning pipelines with Jupyter/Colab for real-time insights.

  6. Personalization & Activation
    Using Agentic AI to deliver personalized experiences via email, push, SMS, and content based on customer intent.

  7. Event-Driven Architecture with ETL/ELT
    Built-in Apache Airflow integration to manage data ingestion, transformation, and orchestration.

  8. Plug-in Ecosystem & API-First Design
    Easy to extend, integrate, and automate via REST APIs and modular services.

  9. Data Governance & Privacy
    Built-in consent tracking, GDPR compliance, and on-prem hosting for full control over customer data.

  10. DevOps Ready
    Docker-based deployment, Prometheus + Grafana monitoring, scalable microservice architecture.


🌍 Why Open Source?

  • Break away from SaaS lock-in. Full customization and ownership of your CDP.
  • Ideal for agencies, startups, enterprises, and researchers building AI-powered marketing stacks.
  • Open source encourages transparency, innovation, and community-driven evolution.

📈 Roadmap 2025+

FeatureStatus
✅ Core CDP Platform (Profiles, Events, Segmentation)Complete
✅ CDP SDKs (JavaScript, Python)Complete
🔄 Identity Resolution with Graph + Vector MatchingIn Progress
🔄 AI Assistant (Chatbot for Audience Insights & Suggestions)In Progress
🔄 Agentic AI: Personalizing the Customer ExperienceIn Progress
🔄 Embedding Model for Customer Vector Search (via Qdrant)In Progress
🆕 CDP Mobile SDKs (Android, iOS, React Native)Planned
🆕 Open Source Campaign Management UIPlanned
🆕 Integration Marketplace for Martech ToolsPlanned
🆕 Webhook + Event Bus Support (Kafka / RabbitMQ / SQS)Planned
🆕 Federated Identity Graph using OpenID & OAuthPlanned

Want to contribute? Join the community!


🧪 System Demo


📚 Documents


🛠️ Tech Stack

  • Backend: Java 11 (Amazon Corretto), Python 3.10
  • Database: ArangoDB 3.11 (Multi-model: Document + Graph + Search)
  • Monitoring: Prometheus 2 + Grafana 8
  • Data Pipeline: Apache Airflow
  • Analytics & ML: Jupyter Notebook / Google Colab
  • Messaging: Redis 6, OneSignal, Firebase
  • Deployment: Ubuntu 22 LTS, Docker, On-Prem / Cloud

☁️ Cloud Options

  • Google Cloud, AWS, VNG Cloud, or your own private infrastructure

🔧 Installation

See: Installation Guide


🧑‍💻 Author & License

Created by: Trieu Nguyen (Thomas)
License: Open Source - MIT-style.
Use freely. Customize. Brand your own white-label CDP. Just respect the original creator 🙏.


💬 Community & Support


📜 Historical Proof of Innovation

Contributors

Showing top 1 contributor by commit count.

View all contributors on GitHub →

This article is auto-generated from trieu/leo-cdp-free-edition via the GitHub API.Last fetched: 6/25/2026