Neo4j
Neo4j is the world's leading graph database management system, developed by Neo4j, Inc. It uses a native property graph model with nodes, relationships (edges), and properties to store and query highly connected data. Neo4j provides ACID-compliant transactions, native graph storage, and the Cypher declarative query language. As of early 2026, Neo4j has surpassed $200 million in annual revenue and serves as the most widely deployed graph database globally. 1)
History
Neo4j's origins trace back to 2000, when founders Peter Neubauer and Johan Svensson encountered limitations with relational databases while building a media management system in Sweden. They modeled data as interconnected networks rather than tables, which improved flexibility despite initial performance challenges. 2)
Johan Svensson implemented a native graph storage engine, and the intellectual property was spun out into Neo4j, Inc. in 2007.
Key milestones:
2000 – Property graph model conceived for media management
2007 – Neo4j, Inc. founded as a standalone company
2009 – XPath-like query syntax developed (precursor to Cypher and Gremlin)
2010 – Term “Property Graph” coined by employee Marko Rodriguez
2011 – Neo4j 1.4 released publicly; Cypher v1 introduced (read-only)
2012 – Neo4j 1.8 added Cypher modifications
2013 – Neo4j 2.0 introduced labels and indexes
2015 – openCypher project launched for broader Cypher adoption
2016 – Neo4j 3.0 released
2018 – Enterprise Edition core closed-source from version 3.5
2021 – $325 million Series F at $2 billion+ valuation
2025 – Revenue surpassed $200 million; IPO preparations began
3)
Cypher Query Language
Cypher is Neo4j's declarative graph query language, inspired by SQL's principle of separating “what” from “how.” It uses an intuitive ASCII-art syntax for pattern matching, with arrows and brackets representing relationships:
MATCH (p:Person)-[:KNOWS]->(f:Person)
WHERE p.name = 'Alice\n RETURN f.name
Cypher was first introduced in 2011 as a read-only language and has evolved through multiple versions to support full CRUD operations, aggregations, subqueries, and graph algorithms. 4)
In 2015, Neo4j launched the openCypher initiative, making Cypher available as an open standard adopted by other graph databases including Memgraph, Amazon Neptune, and SAP HANA Graph.
Use Cases
Neo4j excels in scenarios involving highly connected data:
Knowledge Graphs – modeling interconnected entities for semantic search, AI/ML feature engineering, and enterprise data integration. Neo4j has become a foundational technology for GenAI applications, backing LLMs with structured knowledge graphs for retrieval-augmented generation (RAG).
Fraud Detection – traversing relationship patterns in financial networks to identify anomalous connections, money laundering rings, and suspicious transaction chains. Major banks and financial institutions use Neo4j for real-time fraud analysis.
Recommendation Engines – analyzing user-item-category connections for personalized suggestions, leveraging native graph traversal for collaborative and content-based filtering.
Network and IT Operations – mapping infrastructure dependencies, root cause analysis, and impact assessment.
Identity and Access Management – modeling complex permission hierarchies and organizational structures.
Supply Chain Management – tracking goods, suppliers, and logistics relationships.
Neo4j Aura
Neo4j Aura is the fully managed cloud database-as-a-service offering, available on AWS, GCP, and Azure. 5)
AuraDB Free – limited-size graph database for learning and prototyping
AuraDB Professional – auto-scaling production databases with pay-per-use pricing
AuraDB Enterprise – dedicated infrastructure with advanced security, SSO, VPC peering, and SLA guarantees
Aura Graph Analytics – fully managed graph analytics as a service
Aura Agent – a unified console for managing database instances
Aura eliminates the operational overhead of database management, backups, and scaling while providing the full Neo4j feature set.
Graph Data Science Library
The Neo4j Graph Data Science (GDS) library provides a comprehensive suite of graph algorithms for analytics and machine learning:
Centrality algorithms – PageRank, Betweenness Centrality, Degree Centrality
Community detection – Louvain, Label Propagation, Weakly Connected Components
Pathfinding – Dijkstra, A*, Breadth-First Search, Minimum Spanning Tree
Similarity – Node Similarity, K-Nearest Neighbors
Link prediction – predicting future connections in graphs
Graph embeddings – Node2Vec, FastRP for machine learning feature generation
ML pipelines – end-to-end machine learning workflows within the graph
GDS is available both as a self-managed library and through Aura Graph Analytics. It also integrates with Snowflake AI Data Cloud and Microsoft Fabric through dedicated partnerships.
Funding and Company
Neo4j has raised over $580 million in total funding:
Series E – $80 million
Series F – $325 million (June 2021), led by Eurazeo and GV, the largest single investment round in database history at that time ((Source: [[https://neo4j.com/news/neo4j-announces-325-million-series-f-investment-the-largest-in-database-history/|Neo4j - $325M Series F]]))
Valued at over $2 billion following the Series F
IPO preparations reportedly began in late 2024, targeting a Nasdaq listing
The company is headquartered in San Mateo, California, with offices worldwide. Neo4j's Community Edition is available under a modified GPL license, while the Enterprise Edition requires a commercial license.
See Also
References