This tutorial introduces the JSONiq language, which declaratively manipulates JSON data.
It was moved to an interactive, live tutorial that can be found here.
A user-friendly display of the tutorial is located here, if you just want to take a look with no setup: just ignore the instructions and scroll your way down.
You can run JSONiq queries in Python with our jsoniq package (Java 17 or 21 required with JAVA_HOME correctly set) with this simple install command:
pip install jsoniq
Hello world query:
from jsoniq import RumbleSession
rumble = RumbleSession.builder.getOrCreate();
print(rumble.jsoniq('{ "foo": [ 6*7 ] }').json());
The jsoniq package contains RumbleDB running on top of pyspark. It can be used on your laptop for small or medium amounts of data, or on large clusters on Petabytes of data.
You can also install RumbleDB to run on the command line on your laptop or on a large cluster (Amazon EMR, Azure HDInsight, Google Cloud...). The documentation is here.
The queries in the notebook also can be run on a local installation of Zorba. However, Zorba is no longer officially maintained.