Interests: Databases, Distributed Systems and Information Retrievel.
Currently exploring Hadoop and eco-system.
Spot Award (2011-12 Quarter I), Star Performer (2012-13 Querter II), Three Team awards (2011 ,2012)
Skillset: {hadoop, Sqoop, Core Java, C/C++(Win,Linux), SQL/PL-SQL}
Release Manager, Apache Sqoop 1.4.2
Designed and implemented secondary crawler as a web service for crawling pages that change more frequently. Also worked on Extracting Info. from docs using SystemT.
Leveraged Facebook API to crawl data, dump it to HDFS and later analyze it using JAQL, a Hadoop based querying language.
Extended Apache Sqoop w.r.t. Customer’s database and data warehouse needs.
Used Hive Thrift C++ Client and Server to implement ODBC driver for Apache Hive. Currently mentoring a team of seven as well as implementing ODBC Driver for a cloud-based warehouse.
Designed and implemented queries and Map Reduce scripts in Hive for Microsoft excel add-in; currently mentoring two professionals on this project.
Designed and implemented the algorithms for auto-robots.
Created a social portal for college students in PHP and HTML.
The aim of the project was to detect GoF design patterns used in a given Java code. It involved parsing the Java files, analyzing them using an AST (Abstract Syntax Tree), detecting patterns by means of predefined rules and showing detailed reports on the patterns used. Forming rules required extensive study of the design patterns and existing solutions. I contributed by creating the rules and designing implementation and logical structure of the database.