Data-Toolkit is a Java-based project that provides a set of tools for working with data structures, algorithms, and mathematical concepts. It is designed for educational purposes, prototyping, and reusable component development in the domain of data processing and algorithmic problem solving.
- Vectors : Multi-dimensional vector operations.
- Matrices : Basic linear algebra operations.
- Distances : Euclidean, Manhattan, Minkowski and Levenshtein.
- Points : Representation and manipulation of N-dimensional points.
- Similarities : Jackard similarity and other comparison metrics.
- Binary Search Tree (BST) : Insertion, search, etc.
- Stack : LIFO data structure implementation.
- Queue : FIFO data structure implementation.
- Linked List : Single linked list with standard operations.
- Dijkstra's Algorithm : Shortest path calculation in weighted graphs.
- Agglomerative Hierarchical Clustering Algorithm (AHC / CAH) : Agglomerative clustering algorithm for hierarchical grouping and Dendrogram.
- K-Nearest Neighbors (KNN) : A non-parametric supervised learning method, used for Classification.
Make sure you have Java 8+ and Maven installed on your machine.
-
Clone the repository:
git clone https://github.com/LugolBis/Data-Toolkit.git cd Data-Toolkit
-
Build and run the project:
- Using Maven:
mvn clean install
- Using Maven: