N
Neel Shah
Open Source & Projects

Built & Shipped

Research tools, datasets, Python libraries, and mobile apps — all publicly available.

Research Tools & Datasets

Data & AI Projects

📊

Arxiv Data Analysis

Python · Jupyter · 2017

Comprehensive analysis of 24,000+ research papers from arXiv, mapping the emergence and evolution of machine learning, deep learning, and AI topics over time. Full dataset and code included.

Python Data Analysis AI Research NLP
View on GitHub ↗ · ⭐ 4 · 2 forks
🔬

Scopus India Analysis

Dataset · Python · Pinned

Data analysis of Indian researchers published in Scopus journals (2000–2016). Explores publication trends, collaborative networks, and research output patterns. Full dataset publicly available.

Research Data Scopus Open Dataset
View on GitHub ↗
📁

arxivData

Dataset · Python

A curated dataset of arXiv papers spanning AI, ML, deep learning, computer vision, and neural networks. Designed as a searchable offline research corpus.

AIMLDeep LearningComputer VisionOpen Dataset
View on GitHub ↗

Optimize Python

Python · Guide

A practical guide and code collection for writing high-performance Python — covering profiling, multiprocessing, memory management, and efficient data structures.

Python Performance Best Practices
View on GitHub ↗
Mobile

Mobile Development

📱

KMP API Test App

Kotlin Multiplatform · 2025

A Kotlin Multiplatform Android application demonstrating API integration with modern state management, built as a foundation for cross-platform mobile development.

Kotlin KMP Android
View on GitHub ↗
Historical Achievement

Open Source Library

😊

emot

Python · MIT · 2018

An early open source library for detecting and extracting emojis and emoticons from text. Grew organically to over 1 million downloads — a testament to building focused, well-documented tools that solve exactly one problem well.

1M+ downloads
196 stars
365+ dependents
import emot
obj = emot.core.emot()
"I love python ☮ 🙂 ❤"
obj.emoji(text)
# → detects, locates, names
# → 1M+ engineers rely on this
Coming Soon

In Planning

🗄️
Open Data Platform

Aggregating curated open datasets — health, NLP, social media — for researchers and engineers.

In planning · Data Platform
🤖
AI-Ready Datasets

Publishing clean, labelled datasets specifically designed for LLM fine-tuning and RAG applications.

In planning · LLM Data