Cloudera experience:
- Cloudera platform installation
- Performance tuning experience in Apache Spark / Pyspark, Spark SQL
- Cluster configuration, maintenance, troubleshooting and monitoring
- CDP Private Cloud Base Installation
- HDFS Topology and Roles, HDFS Performance and Fault Tolerance, HDFS and Hadoop Security Overview
- Data Ingest - Ingesting Data, Best Practices for Importing Data
- Data Flow - NiFi Architecture, Apache Kafka, Apache Kafka Cluster Architecture, Apache Kafka Command Line Tools
- Data Access and Discovery - Apache Hive, Apache Impala, Apache Impala Tuning, Managing and Configuring Hue, Hue Authentication and Authorization
- Data Compute - Running Applications on YARN, MapReduce Applications, YARN Memory, and CPU Settings, Tez, ACID, Spark
- Managing Resources - Capacity Scheduler, Managing Queues, Impala Query Scheduling
- Security - Data Governance with SDX, Hadoop Security, Hadoop Authentication Using Kerberos, Hadoop Encryption, Apache Ranger, Apache Atlas, Backup and Recovery
- Private Cloud Capabilities
Didn’t find the job appropriate? Report this Job