Note: This repository is a placeholder. For the full GFQL implementation, documentation, and examples, visit the PyGraphistry GitHub repository and GFQL documentation. See the SQL/Cypher/GFQL comparison guide to understand how GFQL relates to familiar query languages.
GFQL (Graph dataFrame Query Language) is an open-source embeddable graph query language for data scientists, analysts, and developers working with graph data. It combines the expressiveness of graph analytics with the performance of modern dataframe operations, enabling native in-memory large-scale columnar computing on graph structures. GFQL makes it easy to wrangle, transform, and analyze graph data using familiar dataframe-style operations while leveraging GPU acceleration for processing graphs with billions of edges.
GFQL enables graph wrangling without requiring a graph database. You can separate your storage tier (SQL databases, files, data lakes, or graph databases) from how your application or cluster handles graph operations like shaping, cleaning, pattern searching, algorithmic enrichments, and visualization. Work directly with your existing data infrastructure.
GFQL is trusted by banks, startups, security teams, and every Graphistry user for mission-critical graph analytics and investigation workflows.
-
Embeddable Graph Queries: Write expressive graph traversals and pattern matching queries that can be embedded directly in your Python workflows
-
Dataframe-Native Operations: Leverage familiar dataframe APIs (Pandas, cuDF, Apache Arrow) for graph computations with seamless integration. Apache Arrow provides faster & safer data that works with most DBs and data platforms
-
GPU-Accelerated Performance: Process massive graphs with billions of edges using GPU acceleration for unprecedented speed. Part of the NVIDIA Rapids ecosystem for vectorized GPU columnar engine mode on GPUs
-
Columnar Computing: Efficient in-memory columnar storage and operations optimized for modern analytics workloads. Vectorized CPU columnar engine mode on CPUs via Pandas and GPU mode via NVIDIA Rapids
-
Open Source & Extensible: Fully open source with extensible architecture for custom graph operations and integrations
-
Rich Connector Ecosystem: Seamlessly integrate with your existing data infrastructure without requiring a graph database:
Category Connector Tutorials Data Platforms, SQL & Logs Graph Databases Python Tools & Libraries
- SQL/Cypher/GFQL Translation Guide - Compare GFQL with SQL and Cypher query languages
- GFQL Documentation - Complete guide in the PyGraphistry Read the Docs
- PyGraphistry GitHub Repository - Main project repository with GFQL implementation
- 2B Edge Graph GPU Result - 2025 LinkedIn post about GFQL's 2 billion edge graph GPU performance
- What is Graph Intelligence? - 2022 article by Ben Lorica at Gradient Flow on graph intelligence in the compute tier