LIVE Instructor-Led Courses
Dismiss

Big Data Introduction training course

Leverage big data analysis tools and techniques to foster better business decision-making

JBI training course London UK

"Our tailored course provided a well rounded introduction and also covered some intermediate level topics that we needed to know. Clive gave us some best practice ideas and tips to take away. Fast paced but the instructor never lost any of the delegates"

Brian Leek, Data Analyst, May 2022

TAILOR-MADE
Enquire & get a quote
PUBLIC COURSES
Next on 18 Jul - see prices
JBI training course London UK

  • Gain an introduction to Big Data
  • Learn how to define Big Data
  • Select the correct Big Data stores for disparate data sets
  • Process large data sets using Hadoop to extract value
  • Store, manage and analyse unstructured data
  • Leverage Big Data analysis tools and techniques to foster better business decision-making
  • Query large data sets in near real-time with Pig and Hive
  • Plan and implement a Big Data strategy for your organisation

Introduction to Big Data

  • Defining Big Data
  • The four dimensions of Big Data: volume, velocity, variety, veracity
  • Introducing the Storage, MapReduce and Query Stack
  • Delivering business benefit from Big Data
  • Establishing the business importance of Big Data
  • Addressing the challenge of extracting useful data
  • Integrating Big Data with traditional data

Storing Big Data

  • Analysing your data characteristics
  • Selecting data sources for analysis
  • Eliminating redundant data
  • Establishing the role of NoSQL

Overview of Big Data stores

  •  Data models: key value, graph, document, column–family
  •  Hadoop Distributed File System
  •  HBase
  •  Hive
  •  Cassandra
  •  Hypertable
  •  Amazon S3
  •  BigTable
  •  DynamoDB
  •  MongoDB
  •  Redis
  •  Riak
  •  Neo4J

Selecting Big Data stores

  •  Choosing the correct data stores based on your data characteristics
  •  Moving code to data
  •  Implementing polyglot data store solutions
  •  Aligning business goals to the appropriate data store

Processing Big Data

  • Integrating disparate data stores
  • Mapping data to the programming framework
  • Connecting and extracting data from storage
  • Transforming data for processing
  • Subdividing data in preparation for Hadoop MapReduce
  • Employing Hadoop MapReduce
  • Creating the components of Hadoop MapReduce jobs
  • Distributing data processing across server farms
  • Executing Hadoop MapReduce jobs
  • Monitoring the progress of job flows
  • The building blocks of Hadoop MapReduce
  •  Distinguishing Hadoop daemons
  •  Investigating the Hadoop Distributed File System
  •  Selecting appropriate execution modes: local, pseudo–distributed and fully distributed
  • Handling streaming data
  • Comparing real–time processing models
  •  Leveraging Storm to extract live events
  •  Lightning–fast processing with Spark and Shark

Tools and Techniques to Analyse Big Data

  • Abstracting Hadoop MapReduce jobs with Pig
  • Communicating with Hadoop in Pig Latin
  •  Executing commands using the Grunt Shell
  •  Streamlining high–level processing
  • Performing ad hoc Big Data querying with Hive
  • Persisting data in the Hive MegaStore
  • Performing queries with HiveQL
  • Investigating Hive file formats
  • Creating business value from extracted data
  • Mining data with Mahout
  •  Visualising processed results with reporting tools
  • Querying in real time with Impala

Developing a Big Data Strategy

  • Defining a Big Data strategy for your organisation
  •     Establishing your Big Data needs
  •     Meeting business goals with timely data
  •     Evaluating commercial Big Data tools
  •     Managing organisational expectations
  • Enabling analytic innovation
  •     Focusing on business importance
  •     Framing the problem
  •     Selecting the correct tools
  •     Achieving timely results

Implementing a Big Data Solution

  •     Selecting suitable vendors and hosting options
  •     Balancing costs against business value
  •     Keeping ahead of the curve
  •  
JBI training course London UK

IT professionals looking to learn about how to implement and  enhance a corporate big data environment and looking to get a better elementary practical skills relating to Big Data


4.8 out of 5 average

"Our tailored course provided a well rounded introduction and also covered some intermediate level topics that we needed to know. Clive gave us some best practice ideas and tips to take away. Fast paced but the instructor never lost any of the delegates"

Brian Leek, Data Analyst, May 2022

JBI training course London UK
 
Top 20 "Pain Points" for Data Analysts
 

Problem 11 : You have a very complex Excel spreadsheet and you want to reproduce EXACTLY the same spreadsheet in Power BI
Solution: Power BI is not Excel, it works differently and it has different strengths. In order to tackle this issue the best way is going back to the source and try to...

All 20 points are in our latest Newsletter - Delivered directly to your inbox



CONTACT
+44 (0)20 8446 7555

enquiries@jbinternational.co.uk

SHARE

Corporate Policies     Terms & Conditions
JB International Training Ltd  -  Company number 08458005

Registered address Wohl Enterprise Hub 2B Redbourne Avenue London N3 2BS

POPULAR

Rust training course                                                                          React training course

Threat modelling training course   Python for data analysts training course

Power BI training course                                   Machine Learning training course

Spring Boot Microservices training course              Terraform training course

Kubernetes training course                                                            C++ training course

Power Automate training course                               Clean Code training course