Hi, I'm Shrainik Jain.

I am a Computer Science PhD graduate from the Paul G. Allen School for Computer Science and Engineering, University of Washington. My research focuses on learning query representations, generalized SQL workload analytics, high-variety scientific data management, and federated databases.

I currently work as a Software Engineer at Snowflake where I fiddle with the SQL compiler and query optimizer.

I am a part of the UW DB group

.. and am currently working with my advisors Bill Howe and Ed Lazowska on generalized SQL workload analytics.

Querc: Generalized SQL Workload analytics

Using learned query representations to enable Workload Management tasks and Machine Learning for Databases.

Learning Query Representations

Exploring NLP techniques for representation learning on SQL to enable downstream workload management applications.

Polystore Data Management

We are trying to build a query compiler and middleware which optimizes queries across multiple backends, figuring out the best system to run subplans on.

Publications and Patents

  • Database-Agnostic Workload Management: Shrainik Jain, Jiaqi Yan, Thierry Cruanes, BillHowe. In proceedings of the 9th biennial Conference on Innovative Data Systems Research (CIDR 2019). (pdf)
  • Query2Vec: An Evaluation of NLP Techniques for Generalized Workload Analytics: Shrainik Jain, Bill Howe, Jiaqi Yan, Thierry Cruanes. In ArXiv e-prints (Jan. 2018) arXiv:cs.DB/1801.05613. (pdf)
  • Snowtrail: Testing with Production Queries on a Cloud Database:, Jiaqi Yan, Qiuye Jin, Shrainik Jain, Stratis D. Viglas, Allison Lee. In proceedings of 7th International Workshopon Testing Database Systems (DBTEST 2018). US Patent Application No.: 62/646,817. (pdf)
  • The Myria Big Data Management and Analytics System and Cloud Service: Jingjing Wang, Tobin Baker, Magdalena Balazinska, Dan Halperin, Brandon Haynes, Bill Howe, Dylan Hutchison, Shrainik Jain, Ryan Maas, Parmita Mehta, Dominik Moritz, Brandon Myers, Jennifer Ortiz, Dan Suciu, Andrew Whitaker, and Shengliang Xu, In proceedings of the 8th biennial Conference on Innovative Data Systems Research (CIDR).
  • Data Cleaning in the Wild: Reusable Curation Idioms from a Multi-Year SQL Workload: Shrainik Jain, Bill Howe, In proceedings of the 11th International Workshop on Quality in DataBases. (draft) (bibtex)
  • SQLShare: Results from a Multi-Year SQL-as-a-Service Experiment: Shrainik Jain, Dominik Moritz, Daniel Halperin, Bill Howe, Ed Lazowska, In proceedings of the 2016 ACM SIGMOD International Conference on Management of Data. (pdf) (bibtex) (Reproducibility Award)
  • High Variety Cloud Databases: Shrainik Jain, Dominik Moritz, Bill Howe, In proceedings of the 2016 IEEE Cloud Data Management Workshop. (pdf) (bibtex)
  • Pattern Formation for Asynchronous Robots without Agreement in Chirality: Sruti Gan Chaudhuri, Swapnil Ghike, Shrainik Jain, Krishnendu Mukhopadhyaya. (pdf) (bibtex)

Here's some stuff I have done in the past.

In no particular order, importance, or chronology.

JavaScript Orleans

Introduced support for dynamic types and JavaScript development to project Orleans during my internship at Microsoft Research, Redmond.

Disaster Recovery as a Service

Also during my stint at Microsoft India Development Center, I was the part of the team responsible for building HyperV FR based Disaster Recovery as a Service or Azure Site Recovery as it is popularly known as.

MOHORO

Was a part of the team responsible for making MOHORO, a scalable, pay-by-usage Desktop-as-a-Service on Azure. Project served as a precursor to Microsoft Azure RemoteApp. This was during my stint as an SDET at Microsoft India Development Center.

Visualizing Satellite Data

As a summer intern at the Indian Institute of Remote Sensing, Indian Space Research Organization (IIRS-ISRO), I built a tool to visualize raw statelite data on maps and 3D plots. Image Source.

More...

As an undergraduate at the Birla Institute of Technology and Science, BITS, Pilani, I spent most of my time being clueless. Apart from being lost, I was a TA, President and Project forum in-charge (mentor) for the Computer Science Association at various times of my undergrad.