Databricks Engineer

Plain Concepts
Plain Concepts
BrazilRemoteCompetitivoPublicado hace 2 díasRemoto: Remoto
🇬🇧Inglés requeridoData engineer / bi
Plain Concepts

Databricks Engineer

Anuncio original

We're looking for a hands-on Databricks Engineer to help design, build, and scale a modern data platform running on Apache Spark and Delta Lake. This role sits at the intersection of data engineering, platform architecture, and performance optimization. You'll work closely with data scientists, analysts, and backend teams to ensure reliable, high-performance data pipelines and well-governed datasets.

Responsibilities

  • Design and implement end-to-end data pipelines using Databricks (Jobs, Workflows, Delta Live Tables)
  • Build and maintain scalable ETL/ELT processes leveraging Apache Spark (PySpark / Scala)
  • Develop data models using Delta Lake, including schema design, partitioning strategies, Z-ordering, and optimization techniques
  • Manage and optimize Databricks clusters (autoscaling, spot instances, instance pools, cluster policies)
  • Implement CI/CD pipelines for Databricks deployments (e.g., using Databricks Repos, Terraform, Azure DevOps / GitHub Actions)
  • Work with structured and semi-structured data (JSON, Parquet, Avro) at scale
  • Ensure data quality and reliability through validation frameworks, unit/integration testing, and monitoring
  • Implement data governance practices (Unity Catalog, access controls, lineage tracking, auditing)
  • Troubleshoot performance issues (job failures, skew, shuffle bottlenecks, memory pressure) and optimize Spark workloads
  • Integrate Databricks with cloud-native services (AWS S3, Azure Data Lake Storage, GCP BigQuery)
  • Collaborate with data consumers to define SLAs, data contracts, and service interfaces

Requirements

  • Strong experience with Databricks (production workloads, not just notebooks)
  • Deep understanding of Apache Spark internals (execution plan, Catalyst optimizer, Tungsten engine)
  • Proficiency in PySpark (preferred) or Scala
  • Solid knowledge of Delta Lake (ACID transactions, time travel, compaction, OPTIMIZE, VACUUM)
  • Experience with distributed data processing and large-scale datasets (TB+ scale)
  • Familiarity with orchestration tools (Databricks Workflows, Airflow, or similar)
  • Experience with version control and CI/CD pipelines
  • Knowledge of cloud platforms (AWS / Azure / GCP), including IAM and storage services
  • Strong SQL skills and understanding of data warehousing concepts
  • Experience with data modeling techniques (star schema, medallion architecture)

Nice to Have

  • Experience with streaming pipelines (Structured Streaming, Auto Loader)
  • Knowledge of ML workflows on Databricks (MLflow, feature stores)
  • Infrastructure-as-Code experience (Terraform, ARM, CloudFormation)
  • Exposure to Unity Catalog and data governance frameworks
  • Experience with cost optimization strategies in Databricks environments
  • Familiarity with DBT or similar transformation tools
Remoto

Industrial Automation Architect (Octoplant)

Spain
1m

Lead Digital Strategist - Enterprise AI Solutions (m/f/d)

Barcelona
1m

Senior Manual QA Engineer

Central Europe
Nuevo
Remoto

Senior NodeJS Backend Developer

València (Remote)
Nuevo
Híbrido

.NET Engineer

Barcelona (Hybrid)
Nuevo
Híbrido

Senior Engineering Manager, Core Experience - Commerce

Barcelona (Hybrid)
Nuevo
Híbrido

Senior Engineering Manager - Media

Barcelona (Hybrid)
Nuevo

Junior Engineer - Ruby (London)

Barcelona
Nuevo
Híbrido

Engineer - Full Stack

Barcelona (Hybrid)
Nuevo
Híbrido

Machine Learning Engineering Manager - Supply

Barcelona (Hybrid)
Nuevo
Híbrido

Android Engineer

Barcelona (Hybrid)
Nuevo
Híbrido

Junior Android Engineer

Barcelona (Hybrid)
Nuevo
Remoto

Senior Software Engineer (Customer Portal)

Tallinn (Remote), Brazil
1sem
Remoto

Software Engineer (Python) - Credit

Spain / Brazil / Poland / Remote / United Arab Emirates / Romania / Lithuania / Portugal
5 mil € - 8 mil €1sem
Remoto

Expansion Manager LATAM

Spain / Mexico / Brazil / Colombia
2sem
Híbrido

Senior Security Engineer - Cloud & Platform Security

Spain / Brazil / Montevideo / Romania / Barcelona / Madrid
1m
Híbrido

Artificial Intelligence, Technical Referent

Spain / Montevideo / Brazil
2m

Business Analyst (Functional Regulatory Reporting)

Madrid / Mumbai, Maharashtra, India / Bengaluru, Karnataka, India / São Paulo, SP, Brazil / Kraków, Poland / Lisbon, Portugal
3m
Remoto

Enablement Manager – CXG (Customer Experience & Growth)

Romania / Poland / Brazil / Argentina / Mexico / South Africa / Egypt / Spain / Portugal
3m
Remoto

Head of Product – Payments Growth

Spain / Brazil / Argentina / Montevideo / Amsterdam
4m
Remoto

Product Manager - Merchant Developer Experience

Spain / Montevideo / Argentina / Brazil / Barcelona
4m
Remoto

Senior Back-End Engineer (Customer Portal)

Tallinn (Remote), Brazil
4m
Remoto

Traffic Monitoring Manager

Argentina / Brazil / Montevideo (Hybrid) / Colombia / Spain
7m
Remoto

Game Testing - General Application

OPEN TO ALL LOCATIONS / Canada / Germany / Mexico / Philippines / Spain / United Kingdom / United States / Bangladesh / Romania / India / Argentina / Italy / Portugal / Brazil
8m

Candidatura gestionada por Plain Concepts