Hello, For Immediate basis, I can start right away!
We have set skills to connect your laptop to printer, scanner and install office software.
Please check out my past case studies/jobs:
1) For a Company with innovative medical solutions
Developed MapReduce programs to parse the raw data, populate staging tables and store the refined data in partitioned tables in the EDW. Created Hive queries that helped market analysts spot emerging trends by comparing fresh data with EDW reference tables and historical metrics. Enabled speedy reviews and first mover advantages by using Oozie to automate data loading into the Hadoop Distributed File System and PIG to pre-process the data. Provided design recommendations and thought leadership to sponsors/stakeholders that improved review processes and resolved technical problems. Managed and reviewed Hadoop log files. Tested raw data and executed performance scripts. Shared responsibility for administration of Hadoop, Hive and Pig.
2) For a Company with world-class retail, online and delivery capabilities
Installed and configured MapReduce, HIVE and the HDFS; implemented CDH3 Hadoop cluster on CentOS. Assisted with performance tuning and monitoring. Created HBase tables to load large sets of structured, semi-structured and unstructured data coming from UNIX, NoSQL. Created reports for the BI team using Sqoop to export data into HDFS and Hive. Developed multiple MapReduce jobs in Java for data cleaning and preprocessing.
Regards
Mohit