Data Engineer - LLM Pipeline & Data Infrastructure - Amsterdam - ref. b33861915

apartmentVox AI placeAmsterdam business_center€ 60.000 - € 80.000/jaar calendar_month 

We're building an AI-powered conversational system for drive-thru automation. As our Data Engineer, you'll design and implement the infrastructure that powers our multi-stage LLM pipeline, from data capture to processing, model training, and deployment.

Tasks
  • Build scalable real-time data pipelines for audio processing, LLM interactions, and model training
  • Design comprehensive data storage solutions across object storage, NoSQL, and analytical databases
  • Implement data quality management with filtering, normalization, and enrichment capabilities
  • Create automated processes for data preparation, model evaluation, and continuous improvement
  • Develop observability systems with monitoring, alerting, and performance dashboards
  • Establish data security and compliance protocols, including privacy protection measures
  • Build resilient data systems with error recovery, backup, and integrity verification

Requirements

What You'll Need
  • Experience designing data pipelines for AI/ML applications
  • Expertise with Apache Airflow for workflow orchestration
  • Strong knowledge of Apache Spark for large-scale data processing
  • Experience with Apache Kafka for real-time event streaming
  • Proficiency with object storage systems (S3/MinIO) and database technologies (Cassandra/ScyllaDB, ClickHouse)
  • Understanding of monitoring tools (OpenTelemetry) and observability platforms
  • Experience implementing data security and compliance measures
  • Advanced Python programming skills
Preferred Experience
  • Audio data processing and conversational AI systems
  • LLM training and fine-tuning pipelines
  • Data quality frameworks (Great Expectations) and versioning tools (LakeFS, DVC)
  • Kubernetes for container orchestration
  • Multi-region deployment and distributed systems
Benefits
  • Build cutting-edge conversational AI systems with real-world impact
  • Work with modern, open-source technology stack
  • Help shape the future of automated customer service
  • Competitive compensation and flexible work arrangements

If you're passionate about building robust data systems for AI applications and excited by complex real-time data challenges, we'd love to talk.

local_fire_departmentDringend gezocht

Senior Data Engineer

apartmentGrowth Accelerator Staffing B.V.placeAmsterdam
Working as a freelance consultant Startup Accelerator offers Data & AI consultants a kickstart to becoming independent, freelance consultants. We've helped Data Scientists, Machine Learning Engineers, MLOPs Engineers, Data Engineers, and Cloud...
electric_boltDirect beginnen

Azure Data Engineer

placeAmsterdam
sector? Bij Brunel werk je als Azure Data Engineer aan innovatieve cloudprojecten. Je groeit door middel van training, kennisdeling en samenwerking met ervaren collega's. Of je nu starter bent of al ervaring hebt: jouw ontwikkeling staat centraal...
check_circleNieuwe vacature

Senior Data Engineer

apartmentRandstadplaceAmsterdam
Wat je gaat doen Als Data Engineer (of Datawarehouse specialist, BI developer) ben je werkzaam bij de afdeling Financieel Economische Zaken(FEZ) Business Intelligence. Deze afdeling is verantwoordelijk voor het verhogen van het BI niveau binnen...