Projects with open source code, also on GitHub.
View all projects / data-mining related only
Implementing and evaluating the effectiveness of various methods that determine if a number is prime: Trial division, Fermat and Miller-Rabin.
Easy-to-use library to determine the similarity between strings or sets of numbers using Jaccard Index, Minhashing and Locality-Sensitive Hashing.
WebGL app for tri-dimensional worldwide data visualization, with customizable data-to-visual mapping and filtering with adjustable scales.
Architectural choices behind Vokter v0.2, a multilingual document store with built-in diff detection.
Multilingual parser & indexer that uses Locality-Sensitive Hashing, DiffMatchPatch, Bloom filters and Quartz jobs to detect inserted and removed keywords from webpages.
Web app that processes N-Triples, N3 and RDF/XML documents and allows users to infer new data using SPARQL queries and to view relationships in GraphViz.
Developing a decision-tree classifier and a data management module to evaluate win-lose probabilities over the course of a Poker Texas Hold'em game.
Open-source & cross-platform app that performs file encryption and user authentication on existing cloud storage services.