VIDEOS TO LEARN ABOUT OUR UNIQUE TRAINING PROCESS:

Facebook Recommendations

Training Details

Course Duration: 40 hours Training + Assignments + Actual Project Based Case Studies

Training Materials: All attendees will receive,

  • Assignment after each module, Video recording of every session
  • Notes and study material for examples covered.
  • Access to the Training Blog & Repository of Materials

Audience & Pre-Requisites:

  • This course is designed for Systems Administrators and IT Managers who have basic Linux experience. No need for prior knowledge of Apache Hadoop.

Advantages of Hadoop online:

The growing importance of Hadoop around the globe has made Hadoop Admin training one of the important topic. It is important that you understand the concept of Hadoop before you start off with your Apache Hadoop online training program. Our course assumes no prior knowledge of Apache Hadoop and Hadoop Administration as our comprehensive suite of online classes and training provide job-oriented Hadoop training for Hadoop Administrators.

Who should plan on joining?

  • Students, DBAs, System Administrators, Software Architects, Data Warehouse Professionals, IT Managers, and Software Developers interested in learning Hadoop Cluster Administration should go for this course.

Training Format:

This course is delivered as a highly interactive session, with extensive live examples. This course is Live Instructor led Online training delivered using Cisco Webex Meeting center Web and Audio Conferencing tool.

Timing: Weekdays and Weekends after work hours.

Course Objective:

This training aims to provide the participants with a comprehensive understanding of all the steps necessary to operate and maintain a Hadoop cluster. From Installation and configuration through load-balancing and tuning.

The participants will learn the complete Installation of Hadoop Cluster, understand the basic and advanced concepts of Map Reduce and the best practices for Apache Hadoop Development as experienced by the developers and architects of core Apache Hadoop.With the help of hands-on exercises, participants will learn the following topics during the course.

  • The internals of MapReduce andHDFS and how to build Hadoop Architecture.
  • Proper cluster configuration and deployment to integrate with systems and hardware in data centre.
  • How to load data into cluster from dynamically-generated files using Flume and from RDBMS using Sqoop.
  • Configuring the FairScheduler to provide service-level agreements for multiple users of a cluster.
  • Installing and implementing Kerberos-based security for your cluster.
  • Best practices for preparing and maintaining Apache Hadoop in production.
  • Troubleshooting, diagnosing, tuning and solving Hadoop issues.

Note: The course will be have 40% of theoretical discussion and 60% of actual hands on

Project Work and Case Study details and Time spent?

  • We will provide case study based on the real-time project, which takes 4 weeks to develop.
  • The specification and guidance will be given on the case study and the participants need to develop and show the result.

Training Highlights

  • Focus on Hands on training
  • 40 hours of Assignments, Live Case Studies
  • Video Recordings of sessions provided
  • One Problem Statement discussed across the whole training program.
  • Introduction to HADOOP and BIG DATA
  • HADOOP Admin Certification Guidance.
  • 100% Practical with Mentor Guidance
  • Resume prep, Interview Questions provided.
  • Covers All Important Hadoop Ecosystem Products

HADOOP-Admin-Training-Roadmap

How are we Different from other Training Institutes?

Role-specific training instead of Product-based training – We are the leaders in providing **Role-Specific training and e-learning solutions for individuals and corporations. Our curriculum are based on real-time job functions as opposed to being product-based. Real-time scenarios and troubleshooting techniques shown in class.
(**Role based training – Here our trainers share their real-time implementation experience in the class. The trainer will work with participant on several Case Studies based on a actual projects. This gives the participant an understanding of how things are accomplished in real-time environment. The idea is to get the participant familiar of the process, real-time.)

Longer Course Durations – We provide students with more detailed training with Assignments based on the real-time scenarios as well as case studies so that the students take away relevant experience in their respective platform.

We offer Training Blogs using Google Site – The Training Blogs are a common platform for both the trainer as well as the trainees to interact with, discuss queries with the trainers, upload assignments and referring assignments. Training Blogs helps the student to attend the sessions anywhere, anytime, using laptop, desktop or tabs/palmtops.

We provide study materials using Google Drive –We provide access to a Repository of materials for training using Google Drive Cloud. The students are given access to their respective modules using Google Drive for which they have access for lifetime and can be accessed anywhere any time.

For our SAP Trainings –We offer the longest duration of Courses in SAP as compared to any other training institute out there. Our SAP training programs are very detailed. Integration with other SAP modules is covered as a part of our training programs.

Never miss a session – We video record every online training sessions and post the Video recording on the training Blog after the session. So if a students misses a Live Online session, the Video is always available on the Blog. Other students can always go back to these video recordings for review purpose or just to go over.

Highly Qualified and Well Experienced Trainers – Our Trainers are highly qualified and are well experienced in their respective domains. We have trainers from USA, Canada, Australia, Singapore and many other countries.

Case Studies and Assignments Based on Real Scenarios – The Case Studies and Assignments assigned to the students are based on real-time scenarios out the Trainers Past Projects they were involved in.

Certification Assistance – During and at the end of training, the Sr. trainer will provide Certification questions and answers to help you clear the Certification (if required). They will guide each student the required Certification program as well as they themselves are Certified. Every student also receives a ZaranTech Training Completion Certificate as well.

Career Counseling – If you are New to IT and want career counseling to help you decide which stream to go into, please click the link, http://www.zarantech.com/free-career-counseling/ and fill out the Career Counseling form and one of our counselors will get in touch.

Placement Assistance – Our “After the training” team can also help you with Resume prep guidance, Interviews questions and Mock interviews after your training is complete.

Modules Covered in this Training

In this training, attendees learn:

  1. What is Big Data
  2. The Case for Apache Hadoop
  3. The Hadoop Distributed File System
  4. MapReduce
  5. An Overview of the Hadoop Ecosystem
  6. Planning your Hadoop Cluster
  7. Hadoop Installation
  8. Advanced Configuration
  9. Hadoop Security
  10. Managing and Scheduling Jobs
  11. Cluster Maintenance
  12. Cluster Monitoring and Troubleshooting
  13. Installing and Managing Other Hadoop Projects

Attendees also learn:

  1. Resume Preparation Guidelines and Tips
  2. Mock Interviews and Interview Preparation Tips

Topics Covered

What is Big Data?

  • Need for a different technique for Data Storage
  • Need for a different paradigm for Data Analysis
  • The 3 V’s of Big Data
  • Different distributions of Hadoop

The Case for Apache Hadoop

  • A Brief History of Hadoop
  • Core Hadoop Components
  • Fundamental Concepts
  • Hadoop Eco-Systems – Overview

The Hadoop Distributed File System

  • HDFS Features
  • HDFS Design Assumptions
  • Overview of HDFS Architecture
  • Writing and Reading Files
  • Hands-On Exercise

MapReduce

  • What Is MapReduce?
  • Features of MapReduce
  • Basic MapReduce Concepts
  • Architectural Overview
  • What is a Combiner?
  • What is a Practitioner?
  • Hands-On Exercise

An Overview of the Hadoop Ecosystem

  • What is the Hadoop Ecosystem?
  • Integration Tools
  • Analysis Tools
  • Data Storage and Retrieval Tools

Planning your Hadoop Cluster

  • General planning Considerations
  • Choosing the Right Hardware
  • Network Considerations
  • Configuring Nodes

Hadoop Installation

  • Deployment Types
  • Installing Hadoop
  • Basic Configuration Parameters
  • Hands-On Exercise on a Pseudo – Cluster
  • Hands-On Exercise on a Multi-Node Cluster

Advanced Configuration

  • Advanced Parameters
  • core-site.xml parameters
  • mapred-site.xml parameters
  • hdfs-site.xml parameters
  • Configuring Rack Awareness

Hadoop Security

  • Why Hadoop Security Is Important
  • Hadoop’ s Security System Concepts
  • What Kerberos Is and How it Works
  • Integrating a Secure Cluster with Other Systems

Managing and Scheduling Jobs

  • Managing Running Jobs
  • Hands-On Exercise
  • The FIFO Scheduler
  • The Fair Scheduler
  • The Capacity Scheduler
  • Configuring the Fair Scheduler
  • Evaluating the different schedulers
  • Hands-On Exercise

Cluster Maintenance

  • Checking HDFS Status
  • Hands-On Exercise
  • Copying Data Between Clusters
  • Adding and Removing Cluster Nodes
  • Rebalancing the Cluster
  • Hands-On Exercise
  • Name Node Metadata Backup
  • Cluster Upgrading

Cluster Monitoring and Troubleshooting

  • General System Monitoring
  • Managing Hadoop’s Log Files
  • Using the Name Node and Job Tracker Web UIs
  • Hands-On Exercise
  • Cluster Monitoring with Ganglia
  • Common Troubleshooting Issues
  • Benchmarking Your Cluster

Installing and Managing Other Hadoop Projects

  • Hive
  • Pig
  • Hbase
  • Oozie

About Trainer Venkat:

  • 18 years of assisting Staffing organizations in mentoring their candidates in J2EE, Spring, Web Services, SOA, Hadoop & its eco-system components.
  • 5 years of experience on Hadoop & its Eco-system [Pig, Hive, Sqoop, HBase] trainings and actively involved in mentoring organization in their Hadoop Usage Analysis and implementation.
  • Have mentored Microsoft India IT / Professional Services team on HD Insight and JPMC team members on Hadoop understanding and implementations.
  • 18 years of experience in corporate training, specializing in Java with specific emphasis on Spring Core, Spring MVC, Spring Security, Hibernate, and Integration aspects of application Development.
  • 8 years of experience in Savvion BPM Training to customers like IBM, e-Rewards, AT&T, Bell Canana, Sun Microsystems, ADP, Sandia Labs, Penske, PWC, Advanced Equities, Seagate, PayPal, Micron, DocHarbour, Visa, Anacomp, HBO, Citadel, Morgan Stanley, Reply Group (Italy), RBC Dexia (Canada), Rogers Communication (Canada), Virtusa (Sri Lanka), GVT (Brazil), Kernel (Mexico), Bank of America (Canada), Motorola (Germany), Assenda (Columbia), Telecom New Zealand.
  • 5+ years of experience in Mobility Trainings on J2ME, Android and 2 years into iPhone and iPad. Among the first to start grass-root level iOS programming and now expanded that to iPAD applications.
  • Have mentored and consulted organization in the application development using Spring Core, Spring MVC, AOP and ORM.
  • Have exposure in different phases of software development life cycle including Business Requirements, System Analysis, Documentation, Designing, Development, Issues & CR Management, Unit Testing & Integration Testing and Production Deployment and hence relate to the same in Training.
  • Constantly have got a participant satisfaction of more than 92% in all trainings.
  • Specialties: Java, Spring Core, Spring MVC, Android, Savvion BPM, Hibernate, Hadoop, HD Insight

CASE STUDY # 1 – “Healthcare System”

Healthcare System Application:

As the Product Manager for Inner Expressions you are asked to provide one of your largest clients with additional features in the EMR ( Electronic Medical Records Management) System. The client has requested an integrated Referral Management System that tracks patients from Primary care into the Specialist departments. Appointments are created by either the Primary Care Physicians themselves or other clinical staff like Nurse Practitioners or Clinical Assistants. Each appointment must go through the appropriate checks including checking if the patient has an active insurance with the client, whether the insurance program covers the condition of the patient, patient’s preference for location and timings and availability of the Specialist doctor.

Some appointments may have to be reviewed by the Specialists themselves before they can be approved, the administrator of the facility (hospital) must have the ability to choose by appointment type to either make it directly bookable by the Primary Care Staff or as a type that requires review by the specialist. The system should also allow the Primary Care Staff and specialists departments to exchange notes and comments about a particular appointment. If the specialist department requests tests or reports as mandatory for the appointment, the system must ensure that the patient has these available on the date of the appointment.

The system shall also allow users to track the status of patients’ appts & must store the entire clinical history of each patient. This will be used by the hospital for two main purposes; the specialist and the primary care providers will have access to the patients complete medical history before the patient walks in for the appt and hence allowing for better patient care, the Hospital also stores this data in a general data warehouse ( without Protected Health Information) to do analytics on it and come up with local disease management programs for the area. This is aligned with the Hospitals mission of providing top quality preventive medical care.

The Hospital sets about 300 appointments per day and must support about 50 users at the same time. The existing EMR system is based on Java and an Oracle database system.

TASKS

  • Identify Actors, Use Cases, Relationships,
  • Draw Use Case Diagrams
  • Identify Ideal, Alternate and Exception Flows
  • Write a Business Requirements Document

CASE STUDY # 2 – “Asset Management System”

Asset Management Application:

An e Examination system is also known as (e-Pariksha/ Online Examination Scheduler), an Intelligent Web Application which automates the process of pre examination scheduling of Any Academic Institutions, Universities, Colleges and School. This automations primary scope is to save nature by saving tons of paper involved in conducting the examination. All examination communications are done via email management between student and Academia. Usually any examination would start with Exam Registrations, which is connected to Subject Creation, Exam Room Management , Room Allotment, Examination Hall Dairy, and Absentees Information (Variety of Reports) – Required by UniversityThis WebApp edges two sides of Client side and Server side Application. Client side enables student community to fill up their examination registration form online via internet and also they have privileges to check out their examination details like (Day of Start, Complete Time Table, Day-wise Exam Details and Day seating details of the candidate- like room name, seating number subject, date and time. The Server side involves the processing of each candidate exam registration form into workflow like, Subject Loader, Room Management, Seating Manager, Room Allotment, Room Dairies, Absentee Marking, and Rich Crystal Reports to meet various needs of Data set.The WebApp Admin records new chattel into database, deletes archaic ones, and revises any information related to examination. “User”. All users are known to the system by their USN, ID and their The asset management system keeps track of a number of assets that can be borrowed, their ownership, their availability, their current location, the current borrower and the asset history. Assets include books, software, computers, and peripherals. Assets are entered in the database when acquired, deleted from the database when disposed. The availability is updated whenever it is borrowed or returned. When a borrower fails to return an asset on time, the asset management system sends a reminder to the borrower and informs the asset owner.

The administrator enters new assets in the database, deletes obsolete ones, and updates any information related to assets. The borrower search for assets in the database to determine their availability and borrows and returns assets. The asset owner loans assets to borrowers. Each system has exactly one administrator, one or more asset owners, and one or more borrowers. When referring to any of the above actor, we use the term “user”. All users are known to the system by their name and their email address. The system may keep track of other attributes such as the owner’s telephone number, title, address, and position in the organization.

The system should support at least 200 borrowers and 2000 assets. The system should be extensible to other types of assets. The system should checkpoint the state of the database every day such that it can be recovered in case of data loss. Owners and the administrator are authenticated using a user/password combination. Actors interact with the system via a web browser capable of rendering HTML and HTTP without support for JavaScript and Java.

The persistent storage is realized using an SQL database. The business logic is realized using the WebObjects runtime system. The system includes:

TASKS

  • Identify Actors, Use Cases, Relationships,
  • Draw Use Case Diagrams
  • Identify Ideal, Alternate and Exception Flows
  • Write a Business Requirements Document

OTHER CASE STUDIES:

Social Networking, Cruise Management System, Collegiate Sporting system

How to be a certified Hadoop Administrator?

Certification for Hadoop Administrator can be attained by the aspirant in the following steps:
Step 1: Once training is over, Registration must be done for Hadoop CCA-500 exam.
Step 2: Complete the exam and hence you shall be certified.

What are the requirements for the certification?

Basic knowledge of programming and Linux is required to pass this examination.

What is the cost of the examination for Hadoop Administrator certification?

Once the candidate registers on the website, he must pay for the certification exam. The cost is $295.

What are the pattern for the exam and the duration for the test?

Note: The examination would consist of 60 live questions, the duration for the examination is 90 minutes and the passing score is 70%.

Technical Requirements to take an Online training with ZaranTech

Technical Requirements for ZaranTech Online Classes:

  • Operating System: Windows XP or newer
  • Browser: Internet Explorer 6.x or newer
  • CPU: P350 MHz, recommended P500+ MHz
  • Memory: 128 MB, recommended 256+ MB RAM
  • Free Disk Space: 40 MB, recommended 200+ MB for content and recordings
  • Internet Connection: 28.8 Kbps, recommended 128+ Kbps
  • Monitor: 16 bit colors (high color)
  • Other: Sound card, microphone, and speakers OR headset with microphone

What is the Difference between Live training and Video training?

These Videos here will help you understand the difference,
VIDEO – What is Instructor led LIVE Training –http://www.youtube.com/watch?v=G908QvF-gVA
VIDEO – What is Instructor led VIDEO Training – http://www.youtube.com/watch?v=naPdAyKvAI0

Benefits of online training as compared to classroom training

Online Training Benefits
A constantly shifting and changing IT market requires IT professionals to do more with less, making use of new tools and solutions to move forward. Investment in learning and development enables growth in our changing information technology marketplace, giving you the knowledge and skills to act, behave, and perform your job differently. Instructor-led Online training can provide the learning solutions you need in a format that is cost-effective and convenient bringing the interactivity, expertise, and diverse curriculum of our traditional courses to your home or office utilizing state-of-the-art technology. This method of learning allows for live interaction with the trainer and fellow students, without the cost of travel or lodging expenses. To accommodate the demanding schedules of professionals that is trying to do more with less.

Some of the major benefits are :

  1. Full Interactivity –
    Two-way voice over internet and web-conferencing using Cisco WebEx Meeting Center tool. This tool enables participants to ask questions and collaborate with each other in an online virtual space and enables the online trainer to answer questions, take simulations, and receive answers instantaneously. Every trainee can view the trainers desktop and vice versa.
  2. Cost Savings and Convenience –
    Courses can be completed from home, the office, or wherever the Internet is accessible. There is no need to travel to a specific location to attend a training program. Less overhead cost for the company and the savings is passed on to the trainees. Shorter course schedules mean that projects don’t have to be put on hold while participants train (for corporations).
  3. Never Miss a Session –
    With online training, you can receive archived video recorded sessions to all enrollees and the streaming video recording links are posted on the Training blog after each session. Participants may view these sessions to review sessions post-class or make up a missed class as needed. Accesses to Video Recordings are available after the training end thus making it easy for you to review after training ends.
  4. Location Independent –
    You may join for an online instructor-led course from any part of the world without having to travel. Trainees can attend from USA, Canada, New Zealand, UK, Australia, India and many other countries around the world.
  5. Affordable –
    Classroom sessions are expensive. You pay for Hotel, Food, Travel plus Course Fees. All those overhead costs quickly add up to more than 5,000 dollars. Online training programs costs less and is a fraction of that cost of classroom training.
  6. Career Focused –
    The online IT training courses match the tasks, assignments or projects you perform for employers on the job guaranteeing that the new skills you gain after training are immediately relevant to your career or employer.
  7. Shorter Sessions –
    By providing shorter session duration and then providing assignments, gives the trainees time to understand the concepts and practice from the assignments and be prepared for the next session. Online training sessions are each 2-3 hrs long and only cover 10hrs per week. Classes are scheduled 2-3 days apart giving you time to practice.
  8. Computer-Aided Simulation Learning –

    A growing number of Online Training courses are utilizing computer-aided simulation. This feature allows you to learn by making critical decisions in a realistic and safe “virtual” business setting. The consequences of your actions can be comprehended immediately. It is a highly-effective method for realizing the potential short- and long-term benefits (or dangers) of specific actions and decisions. The lessons learned using simulation are entrenched in your mind and can be applied to your role immediately.

  9. Minimum Technical Requirements:
    • Operating System: Windows XP or newer
    • Browser: Internet Explorer 6.x or newer
    • CPU: P350 MHz, recommended P500+ MHz
    • Memory: 128 MB, recommended 256+ MB RAM
    • Free Disk Space: 40 MB, recommended 200+ MB for content and recordings
    • Internet Connection: 28.8 Kbps, recommended 128+ Kbps
    • Monitor: 16 bit colors (high color)
    • Others: Sound card, microphone, and speakers OR headset with microphone

How soon after I Enroll would I get access to the Training Program and Content?

Right after you have Enrolled, we will send you an Email to your Gmail id with a Video on How To login to the training blog and get access to the training program and content.

What are the pre-requisites of taking this training?

– Students, DBAs, System Administrators, Software Architects, Data Warehouse Professionals, IT Managers, and Software Developers interested in learning Hadoop Cluster Administration should go for this course.
– This course is designed for Systems Administrators and IT Managers who have basic Linux experience. No need for prior knowledge of Apache Hadoop.

Who are the instructors and what are their qualifications?

All our instructors are Senior Consultants themselves with a minimum of 10 years of real-time experience in their respective fields. Each trainer has also trained more than 100 students in the individual and/or corporate training programs.

How will be the practicals/assignments done?

Practicals/assignments will be done using the training blog. Instructions will be sent after you enroll.

When are the classes held and How many hours effort would I need to put in every day/week?

Online Live sessions are held weekdays evening CST (Central Standard Time GMT-6) or on Weekends. The schedule is posted for each batch on the website. You have to put in a effort of 8-10 hrs per week going thru the videos once again and completing your assignments.

What if I miss a class?

We Video record every Live session and after the session is complete, we will post the Video recording of that session in the blog. You will have access to these Video recordings for 6 months from the date you start your training. Material access will be provided using Google Drive Cloud for lifetime.

How can I request for a support session?

You can do that by posting a question on the training blog.

What If I have queries after I complete this course?

You can post those questions on the training blog.

Will I get 24*7 Support ?

You will get 24*7 accesss to the blog to post your questions. Trainers will answer your questions within 24 hrs of time. Normally they answer very frequently, like about 1-2 hrs. You can also approach your training coordinator for the same.

Can I get the recorded sessions of a class from some other batches before attending a live class?

Yes, you can. Or you can see our Youtube page for previous batch session recordings.

How will I get the recorded sessions?

It will be provided to you through the trainng blog.

How can I download the class videos?

You wont be able to download any videos. They are available for you to View using the training blog 24*7.

Is the course material accessible to the students even after the course training finishes?

Yes.

Do you provide Placements as well

We are infact, a Consulting company which provides training so we are mainly looking for trainees who are looking for Placement after training.
After the Training Process explained (Video): http://www.youtube.com/watch?v=BrBJjoH46VI
Our 6-step training to placement process (Video): http://www.youtube.com/watch?v=BrBJjoH46VI

How can I complete the course in a shorter Duration?

Enroll to our Self paced video training.
Video Explanation – What is Instructor led VIDEO Training – https://www.youtube.com/watch?v=v1P9_fkg9mE

Do you provide any Certification? If yes, what is the Certification process?

We provide Certification guidance at the end of each course. You will also receive a “Certificate of Completion” from ZaranTech at the end of the course.

Are these classes conducted via LIVE video streaming?

We have both the options available

What internet speed is required to attend the LIVE classes?

1Mbps of internet speed is recommended to attend the LIVE classes. However, we have seen people attending the classes from a much slower internet.

What are the payment options?

We accept Credit Cards, Paypal, Bank payments from anywhere in USA, Money orders, International Wire transfer, ACH transfers, Chase Quickpay, Bank of America transfers, Wellsfargo Surepay. All the payments details are mentioned on the Enrollment page.

What if I have more queries?

Call the number listed on the Course Details page of our website.