Home / Companies / Neo4j / Blog / Post Details
Content Deep Dive

Using Apache Spark and Neo4j for Big Data Graph Analytics

Blog post from Neo4j

Post Details
Company
Date Published
Author
Kenny Bastani
Word Count
1,356
Language
English
Hacker News Points
-
Summary

Mazerunner, an unmanaged extension for Neo4j, extends its capabilities to perform big data graph processing jobs while persisting the results back to Neo4j. It utilizes a message broker to distribute graph processing jobs to Apache Spark's GraphX module and persists the results in HDFS. Mazerunner enables scalable analysis of big data by providing a powerful tool for fast and efficient graph processing, allowing companies to gain competitive advantages with its ability to analyze large datasets quickly. The platform has shown promising results in running complex analyses on massive datasets, such as a Wikipedia dump, which resulted in a graph with over 10 million nodes and 104 million relationships, completing the analysis in under 3 hours on a laptop.