- 
                Notifications
    You must be signed in to change notification settings 
- Fork 458
feat(analytics): Graph-Analytics-Benchmarks #4625
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Pull Request Overview
This PR introduces the open-sourced artifacts from the SIGMOD 2025 paper "Revisiting Graph Analytics Benchmark", providing a comprehensive benchmarking framework for graph analytics platforms. The framework includes three main components: a Failure-Free Trial Data Generator (FFT-DG), an LLM-based API usability evaluation framework, and performance evaluation scripts for multiple graph platforms.
- Adds comprehensive documentation and setup instructions for benchmarking seven graph platforms (Flash, Ligra, Grape, Pregel+, Gthinker, PowerGraph, GraphX)
- Provides execution scripts and Kubernetes YAML templates for distributed performance testing using MPI operators
- Includes a C++ data generator with configurable scale, density, and diameter parameters
Reviewed Changes
Copilot reviewed 23 out of 23 changed files in this pull request and generated 13 comments.
Show a summary per file
| File | Description | 
|---|---|
| analytical_engine/benchmarks/framework/README.md | Main documentation explaining the benchmark suite, supported platforms, and usage instructions | 
| analytical_engine/benchmarks/framework/*/run.sh | Execution scripts for each platform's benchmarking workflow | 
| analytical_engine/benchmarks/framework//.yaml | Kubernetes MPIJob templates for distributed testing | 
| analytical_engine/benchmarks/framework/Graphx/*.sh | Spark-based execution scripts for GraphX platform | 
| analytical_engine/benchmarks/framework/Data_Generator/* | C++ data generator implementation and configuration files | 
Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.
        
          
                analytical_engine/benchmarks/framework/PowerGraph/graphlab-mpijob-template.yaml
          
            Show resolved
            Hide resolved
        
      Co-authored-by: Copilot <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Copilot <[email protected]>
Co-authored-by: Copilot <[email protected]>
| /cc @yecol @sighingnow, this issus/pr has had no activity for a long time, please help to review the status and assign people to work on it. | 
…aml to analytical_engine/benchmarks/framework/config/flash-mpijob-template.yaml
…aml to analytical_engine/benchmarks/framework/config/grape-mpijob-template.yaml
…e.yaml to analytical_engine/benchmarks/framework/config/gthinker-mpijob-template.yaml
…aml to analytical_engine/benchmarks/framework/config/ligra-mpijob-template.yaml
…ate.yaml to analytical_engine/benchmarks/framework/config/powergraph-mpijob-template.yaml
What do these changes do?
This PR contributes the open-sourced artifacts of our SIGMOD 2025 paper Revisiting Graph Analytics Benchmark.
Specifically, it adds:
These artifacts ensure reproducibility of the paper’s results and provide the community with a standardized, extensible benchmark suite for cross-platform graph analytics.
Related issue number
N/A
Fixes