Data Engineer - LLM Pipeline & Data Infrastructure - Amsterdam - ref. b33861915
Vox AI Amsterdam € 60.000 - € 80.000/jaar
We're building an AI-powered conversational system for drive-thru automation. As our Data Engineer, you'll design and implement the infrastructure that powers our multi-stage LLM pipeline, from data capture to processing, model training, and deployment.
Tasks- Build scalable real-time data pipelines for audio processing, LLM interactions, and model training
- Design comprehensive data storage solutions across object storage, NoSQL, and analytical databases
- Implement data quality management with filtering, normalization, and enrichment capabilities
- Create automated processes for data preparation, model evaluation, and continuous improvement
- Develop observability systems with monitoring, alerting, and performance dashboards
- Establish data security and compliance protocols, including privacy protection measures
- Build resilient data systems with error recovery, backup, and integrity verification
Requirements
What You'll Need- Experience designing data pipelines for AI/ML applications
- Expertise with Apache Airflow for workflow orchestration
- Strong knowledge of Apache Spark for large-scale data processing
- Experience with Apache Kafka for real-time event streaming
- Proficiency with object storage systems (S3/MinIO) and database technologies (Cassandra/ScyllaDB, ClickHouse)
- Understanding of monitoring tools (OpenTelemetry) and observability platforms
- Experience implementing data security and compliance measures
- Advanced Python programming skills
- Audio data processing and conversational AI systems
- LLM training and fine-tuning pipelines
- Data quality frameworks (Great Expectations) and versioning tools (LakeFS, DVC)
- Kubernetes for container orchestration
- Multi-region deployment and distributed systems
- Build cutting-edge conversational AI systems with real-world impact
- Work with modern, open-source technology stack
- Help shape the future of automated customer service
- Competitive compensation and flexible work arrangements
If you're passionate about building robust data systems for AI applications and excited by complex real-time data challenges, we'd love to talk.
Growth Accelerator Staffing B.V.Amsterdam
Working as a freelance consultant
Startup Accelerator offers Data & AI consultants a kickstart to becoming independent, freelance consultants. We've helped Data Scientists, Machine Learning Engineers, MLOPs Engineers, Data Engineers, and Cloud...
Amsterdam
sector? Bij Brunel werk je als Azure Data Engineer aan innovatieve cloudprojecten. Je groeit door middel van training, kennisdeling en samenwerking met ervaren collega's. Of je nu starter bent of al ervaring hebt: jouw ontwikkeling staat centraal...
RandstadAmsterdam
Wat je gaat doen
Als Data Engineer (of Datawarehouse specialist, BI developer) ben je werkzaam bij de afdeling Financieel Economische Zaken(FEZ) Business Intelligence. Deze afdeling is verantwoordelijk voor het verhogen van het BI niveau binnen...