GRASP Project

This is a implementation of the paper: Data-Agnostic Cardinality Learning from Imperfect Workloads.

This repo contains:

🪐 A simplified PyTorch implementation of GRASP, containing core functionalities of the GRASP system.
⚡️ A PyTorch implementation of ArCDF, improving on prior work NeuroCDF.
🛸 A self-contained Python file for reproducing the main experiments on CEB-IMDb-full.
🛸 A self-contained Python file for reproducing the main experiments on DSB.
🎉A Python script for running the query end-to-end experiments.

Preparation

Dataset/Workloads

Download CEB-IMDb-full (i.e., CEB-IMDb-13k) benchmark, and place the entire directory in your IMDB_DIRECTORY in train_grasp_ceb.py .
The DSB workload is contained in this file.

Query Optimization

Please download and install the modified PostgreSQL from here.
Download the IMDb dataset from here, and download the populated DSB dataset used in the paper from here.
Please load the data into PostgreSQL.

Usage

Training GRASP over CEB-IMDb-full

To train the GRASP model over CEB-IMDb-full, run the following command:

python train_grasp_ceb.py

Training GRASP over DSB

To train the GRASP model over DSB, run the following command:

python train_grasp_dsb.py

Configuration

The training scripts can be configured by modifying the parameters in the respective train_grasp_*.py files. Key parameters include:

epoch: Number of training epochs
feature_dim: Dimension of CardEst models
lcs_dim: dimension of the Learned Count Sketch Models
bs: Batch size
lr: Learning rate

Utilities

The project includes various utility functions and classes located in the CEB_utlities and dsb_utlities directories. These utilities are used for data/workloads processing.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

If you have any questions, feel free to contact me through email ([email protected]).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
CEB_utlities		CEB_utlities
GRASP		GRASP
arcdf		arcdf
dsb_utlities		dsb_utlities
queries		queries
GRASP_Camera_Ready.pdf		GRASP_Camera_Ready.pdf
LICENSE		LICENSE
README.md		README.md
grasp.png		grasp.png
overview.png		overview.png
query_optimization.py		query_optimization.py
train_grasp_ceb.py		train_grasp_ceb.py
train_grasp_dsb.py		train_grasp_dsb.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GRASP Project

Preparation

Dataset/Workloads

Query Optimization

Usage

Training GRASP over CEB-IMDb-full

Training GRASP over DSB

Configuration

Utilities

License

Contact

About

Uh oh!

Releases

Packages

Languages

License

shoupzwu/GRASP

Folders and files

Latest commit

History

Repository files navigation

GRASP Project

Preparation

Dataset/Workloads

Query Optimization

Usage

Training GRASP over CEB-IMDb-full

Training GRASP over DSB

Configuration

Utilities

License

Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages