Skip to content

The Graph dataFrame Query Language: The open source embeddable graph wrangling with native in-memory large-scale columnar computing. See pygaphistry OSS repo.

License

Notifications You must be signed in to change notification settings

graphistry/gfql

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

GFQL

Note: This repository is a placeholder. For the full GFQL implementation, documentation, and examples, visit the PyGraphistry GitHub repository and GFQL documentation. See the SQL/Cypher/GFQL comparison guide to understand how GFQL relates to familiar query languages.

What is GFQL?

GFQL (Graph dataFrame Query Language) is an open-source embeddable graph query language for data scientists, analysts, and developers working with graph data. It combines the expressiveness of graph analytics with the performance of modern dataframe operations, enabling native in-memory large-scale columnar computing on graph structures. GFQL makes it easy to wrangle, transform, and analyze graph data using familiar dataframe-style operations while leveraging GPU acceleration for processing graphs with billions of edges.

GFQL enables graph wrangling without requiring a graph database. You can separate your storage tier (SQL databases, files, data lakes, or graph databases) from how your application or cluster handles graph operations like shaping, cleaning, pattern searching, algorithmic enrichments, and visualization. Work directly with your existing data infrastructure.

GFQL is trusted by banks, startups, security teams, and every Graphistry user for mission-critical graph analytics and investigation workflows.

Top Features

  • Embeddable Graph Queries: Write expressive graph traversals and pattern matching queries that can be embedded directly in your Python workflows

  • Dataframe-Native Operations: Leverage familiar dataframe APIs (Pandas, cuDF, Apache Arrow) for graph computations with seamless integration. Apache Arrow provides faster & safer data that works with most DBs and data platforms

  • GPU-Accelerated Performance: Process massive graphs with billions of edges using GPU acceleration for unprecedented speed. Part of the NVIDIA Rapids ecosystem for vectorized GPU columnar engine mode on GPUs

  • Columnar Computing: Efficient in-memory columnar storage and operations optimized for modern analytics workloads. Vectorized CPU columnar engine mode on CPUs via Pandas and GPU mode via NVIDIA Rapids

  • Open Source & Extensible: Fully open source with extensible architecture for custom graph operations and integrations

  • Rich Connector Ecosystem: Seamlessly integrate with your existing data infrastructure without requiring a graph database:

    Category Connector Tutorials
    Data Platforms, SQL & Logs
    Graph Databases
    Python Tools & Libraries

    View all connectors →

Next Steps

About

The Graph dataFrame Query Language: The open source embeddable graph wrangling with native in-memory large-scale columnar computing. See pygaphistry OSS repo.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •