Data Engineer
. Apply strong understanding of Apache Spark architecture (executors, partitions, shuffles, joins, caching) to improve performance Partner...
. Apply strong understanding of Apache Spark architecture (executors, partitions, shuffles, joins, caching) to improve performance Partner...
) Experience with tools such as Spark, Python, Scala, Linux Shell, Autosys, etc. Familiarity with interfaces like Linux Command...
, or education. Experience with Big Data Platforms (Source, Conform, Curate). Proficiency in tools such as Spark, Python, Scala...
supporting NextGen Platforms built around Big Data Technologies, including Hadoop, Spark, Kafka, Impala, HBase, and Docker... Hadoop components such as HDFS, Sentry, HBase, Kafka, Impala, SOLR, Hue, Spark, Hive, YARN, Zookeeper, and Postgres...
transformations using Python + PySpark and optimize performance for large datasets Apply strong understanding of Apache Spark... Strong hands-on development in Python, PySpark / Apache Spark, and Advanced SQL Experience working with large-scale data sets...
transformations using Python + PySpark and optimize performance for large datasets Apply strong understanding of Apache Spark... Strong hands-on development in Python, PySpark / Apache Spark, and Advanced SQL Experience working with large-scale data sets...
, military experience, or education. Strong hands-on development in Python, PySpark/Apache Spark, and advanced SQL. Experience... transformations using Python PySpark and optimize performance for large datasets. Apply a strong understanding of Apache Spark...
, training, military experience, or education. Strong hands-on development experience in Python, PySpark/Apache Spark... and optimize performance for large datasets. Apply strong understanding of Apache Spark architecture to improve performance...
such as Spark, Hadoop, Python, and Scala Architect and deploy big data workloads using Amazon EMR on EKS (Kubernetes) Build... with Spark, Hadoop, Hive, Trino Expertise handling large-scale (petabyte-level) datasets Deep understanding of performance...
, MongoDB, Hadoop, Cloudera, Spark, or Teradata Expertise in Linux and container platforms (Kubernetes) Experience with cloud...