Skip to content

Conversation

@Lingkai981
Copy link

What do these changes do?

This PR contributes the open-sourced artifacts of our SIGMOD 2025 paper Revisiting Graph Analytics Benchmark.

Specifically, it adds:

  • Failure-Free Trial Data Generator (FFT-DG) for generating benchmark datasets with controlled scale, density, and diameter.
  • LLM-based API usability evaluation framework, packaged with Docker, enabling reproducible usability studies without costly human evaluation.
  • Performance evaluation scripts and assets for running benchmarks on multiple graph platforms (GraphX, PowerGraph, Flash, Grape, Pregel+, Ligra, G-thinker) in Kubernetes + Docker environments.

These artifacts ensure reproducibility of the paper’s results and provide the community with a standardized, extensible benchmark suite for cross-platform graph analytics.

Related issue number

N/A

Fixes

@CLAassistant
Copy link

CLAassistant commented Sep 8, 2025

CLA assistant check
All committers have signed the CLA.

@longbinlai longbinlai changed the title Graph-Analytics-Benchmarks featGraph-Analytics-Benchmarks Sep 8, 2025
@longbinlai longbinlai changed the title featGraph-Analytics-Benchmarks feat(analytics): Graph-Analytics-Benchmarks Sep 8, 2025
@longbinlai longbinlai requested a review from Copilot September 8, 2025 03:24
Copy link

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR introduces the open-sourced artifacts from the SIGMOD 2025 paper "Revisiting Graph Analytics Benchmark", providing a comprehensive benchmarking framework for graph analytics platforms. The framework includes three main components: a Failure-Free Trial Data Generator (FFT-DG), an LLM-based API usability evaluation framework, and performance evaluation scripts for multiple graph platforms.

  • Adds comprehensive documentation and setup instructions for benchmarking seven graph platforms (Flash, Ligra, Grape, Pregel+, Gthinker, PowerGraph, GraphX)
  • Provides execution scripts and Kubernetes YAML templates for distributed performance testing using MPI operators
  • Includes a C++ data generator with configurable scale, density, and diameter parameters

Reviewed Changes

Copilot reviewed 23 out of 23 changed files in this pull request and generated 13 comments.

Show a summary per file
File Description
analytical_engine/benchmarks/framework/README.md Main documentation explaining the benchmark suite, supported platforms, and usage instructions
analytical_engine/benchmarks/framework/*/run.sh Execution scripts for each platform's benchmarking workflow
analytical_engine/benchmarks/framework//.yaml Kubernetes MPIJob templates for distributed testing
analytical_engine/benchmarks/framework/Graphx/*.sh Spark-based execution scripts for GraphX platform
analytical_engine/benchmarks/framework/Data_Generator/* C++ data generator implementation and configuration files

Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.

@github-actions github-actions bot added the stale label Oct 12, 2025
@github-actions
Copy link
Contributor

/cc @yecol @sighingnow, this issus/pr has had no activity for a long time, please help to review the status and assign people to work on it.

…aml to analytical_engine/benchmarks/framework/config/flash-mpijob-template.yaml
…aml to analytical_engine/benchmarks/framework/config/grape-mpijob-template.yaml
…e.yaml to analytical_engine/benchmarks/framework/config/gthinker-mpijob-template.yaml
…aml to analytical_engine/benchmarks/framework/config/ligra-mpijob-template.yaml
…ate.yaml to analytical_engine/benchmarks/framework/config/powergraph-mpijob-template.yaml
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants