(Retired) Performing Data Engineering on Microsoft HD Insight


The main purpose of the course is to give students the ability plan and implement big data workflows on HDInsight.

Audience Profile

The primary audience for this course is data engineers, data architects, data scientists, and data developers who plan to implement big data engineering workflows on HDInsight.


In addition to their professional experience, students who attend this course should have:
  • programming experience using R, and familiarity with common R package
  • knowledge of common statistical methods and data analysis best practices
  • basic knowledge of the Microsoft Windows operating system and its core functionality
  • working knowledge of relational databases.


At the end of the course, students will be able to:

  • deploy HDInsight Clusters
  • authorising Users to Access Resources
  • loading Data into HDInsight
  • troubleshooting HDInsight
  • implement Batch Solutions
  • design Batch ETL Solutions for Big Data with Spark
  • analyse Data with Spark SQL
  • analyse Data with Hive and Phoenix
  • describe Stream Analytics
  • implement Spark Streaming Using the DStream API
  • develop Big Data Real-Time Processing Solutions with Apache Storm
  • build Solutions that use Kafka and HBase.

Onsite training?

If you need training for three or more people, ask us about training at your site. You can enjoy the convenience of reduced travel cost and time, as well as a familiar environment for your staff. Additionally, we can customise the course for your business needs.

User Reviews

Not yet rated. You will need to follow this course before you can write a review.

Course Info

  • Code: 20775
  • Duration: 5 Days
  • Price: Call for price

Next Step

Enquire Now
Tell a Friend