Role Overview
We are looking for a Senior Data Analyst who can turn complex, multi-source data into clear, decision-driving insight across blockchain gaming, speech AI benchmarking, and platform analytics. You will work closely with the Head of Data to interrogate the KGen data lake, produce benchmark analyses for IndicBench, build Superset dashboards, and support KAI (our NL2SQL tool) with robust SQL logic and schema documentation. This is a high-ownership role — we expect you to define metrics, question assumptions, and communicate findings with precision.
What You Will Work On
- Blockchain and user engagement analytics — query KGen's Athena data lake to produce player behaviour insights, token flow analysis, and game performance reports; work with on-chain data across Aptos, BSC, Ethereum, and Polygon
- Fraud and anomaly detection — support investigation of suspicious wallet activity, sequential email pattern analysis, and device-based fraud signals using Dune Analytics and internal pipelines
- POE (User Profiling Engine) — maintain and extend KGen's primary data product: user segmentation, retention cohorts, engagement scoring, and cross-game behaviour profiling
- Benchmark metrics and reporting — analyse ASR benchmark results across 16 languages and 14 providers; interpret WER, CER, BERTScore, PIER, DER, and code-switching metrics; produce internal QA reports and publish-ready benchmark summaries
- KAI support — validate NL2SQL outputs from our GPT-4-powered analytics tool against ground truth SQL; document schema, contribute FQA (Frequently Queried Analytics) examples, and maintain the Pinecone retrieval index
- Superset dashboard ownership — build, maintain, and iterate on dashboards for KGen leadership; resolve permission and connectivity issues; train team members on self-serve analytics
- Data quality monitoring — define and track data quality KPIs across ingestion pipelines; flag anomalies and escalate to engineering
- Stakeholder reporting — translate analytical findings into concise, credible outputs for founders, investors, and external partners; support LinkedIn research posts and benchmark publication materials
You Should Have
- 4+ years in an analytical role with ownership of reporting and insight delivery
- Advanced SQL — window functions, CTEs, complex joins; comfortable writing production-quality queries against Athena or equivalent
- Experience analysing on-chain or financial transaction data (blockchain familiarity is strongly preferred)
- Proficiency with BI tooling — Superset, Metabase, Looker, or equivalent; able to build from scratch, not just edit
- Ability to work with Python for data wrangling (pandas, analysis notebooks) — you do not need to be an engineer, but you should be self-sufficient
- Experience with NLP evaluation metrics or ML model benchmarking is a strong plus
- Exceptional written communication — you can write a benchmark report that is technically precise and externally publishable
- Comfort with ambiguity — our data landscape spans gaming, blockchain, and speech AI; intellectual range matters