• Doctor of Philosophy (Ph.D.) Computer Science   [ Fall 2018 - present ]
    The University of Texas at Dallas, TX, USA
    Advisor: Prof. Sriraam Natarajan
  • Master of Science (M.S.) Computer Science   [ Fall 2017 - present ]
    The University of Texas at Dallas, TX, USA
  • Bachelor of Technology (B.Tech.) Information and Communication Technology   [ May 2013 ]
    Dhirubhai Ambani Institute of ICT (DA-IICT), Gandhinagar, India
    Thesis Advisor: Prasenjit Majumder
    Thesis Topic: Language identification for short text in transliterated space


  • A Unified Framework for Knowledge Intensive Gradient Boosting: Leveraging Human Experts for Noisy Sparse Domains,
    Harsha Kokel, Phillip Odom, Shuo Yang, Sriraam Natarajan, In AAAI 2020.
    paper | blog | code | supplemental | DOI | poster | slide
  • Morpheme Extraction Task,
    Rashmi Sankepally, Harsha Kokel, Komal Agarwal, Prasenjit Majumder, In FIRE 2013
    paper | DOI



  • ML Intern, Turvo Inc., CA, USA   [ Summer 2018 ]
    Modeled cost estimator that leverages knowledge of the domain experts. Kokel et al. AAAI 2020 was motivated by the work at Turvo.
  • Senior Software Engineer, Amadeus Software Labs, Bangalore, India   [ 2016-17 ]
    Implemented efficient low fare search for Air Canada for elongated period.
  • Associate Technology, Publicis Sapient, Bangalore, India   [ 2013-16 ]
    Provided content management solutions for enhanced digital presence.


  • Research Assistant, UT Dallas, TX, USA   [ Fall 2018 ]
    Working on DARPA's Communicating with Computers grant
  • Teaching Assistant, UT Dallas, TX, USA   [ Fall 2018 ]
    CS6343 and CS4365, graduate and undergraduate level class of Artificial Intelligence.
  • Research Assistant, DA-IICT, Gandhinagar, India   [ 2012-13 ]
    Worked on Sandhan, a multilingual search engine for 8 Indian languages. Including cross lingual search. I developed the Query Builder for Indian languages with query expansion for relevance judgment.

Technical skills

    Python, Java, C, Shell Scripting, MATLAB, R, Linux/Unix, Git, SQL, Prolog, PDDL, Jupyter.

Selected Projects

Communicating with Computers

This is a DARPA funded project to build intelligent minecraft agent that can communicate and collaborate with humans through chat to build structures. [ video ]

Knowledge Intensive Gradient Boosting

Leveraging qualitative domain knowledge while learning tree based gradient boosting models to improve predictions in regions where data is noisy or absent. [ code ]

Learning Sparse Graph for GNN

Used meta-learning techniques to optimize the graph structure of obtain sparse graph for GNN.

Causal inference from Protein Expression Data

Discovering causal molecular relationships from the evaluation of observational data using do-calculus. [ details ]


Developed an interface that allows users with basic understanding of ER Diagrams to provide search bias for Inductive Logic Programming based. As described in Hayes et al. 2017. [ code ]

Expression Detection

A small project to detect wink and shush expression using OpenCV. [ code ]

RL for Healthcare

Learning polices for management of children on ECMO using batch reinforcement learning techniques. [ details ]

SRL model for credit default

Learnt and evaluated a statistical relational model for Kaggle Home credit default risk dataset and compared it with propositional models.

ML/AI basket

My basket of Machine Learning and AI algorithms implemented over time. [ code ]

Academic Service

  • Student volunteer at ICDE 2020.
  • Assistant Electronic Publishing Editors for JAIR 2020 - present.
  • Reviewed papers for CODS-COMAD, 2020 and SDM, 2020.
  • Helped organize meeting of Forum for Information Retrieval Evaluation (FIRE), 2018, 2013.
  • Conducted a lab on Information Retrieval in Microsoft Research India & IRSI Pre-FIRE workshop, 2013.
  • Co-organized Morpheme Extraction Task at FIRE 2013.

My Github Contribution

My Github chart