Open to new opportunities

Saad
Driouech

Machine Learning Engineer with a research background in generative AI and applied ML. M.Sc. thesis introduced the first spatial control framework for scale-wise autoregressive T2I generation. Experience spans diffusion models, autoregressive generation, RAG pipelines, and AI automation. 5 peer-reviewed publications. Based in Fürth, Germany.

5Publications
3+Years ML exp.
1.2M.Sc. grade
4Languages
Saad Driouech

About me

Researcher. Engineer.
Builder.

I'm a Machine Learning Engineer with a research background in generative AI, NLP, and applied ML. Currently completing my M.Sc. in Artificial Intelligence at Friedrich-Alexander-Universität Erlangen-Nürnberg (grade 1.2, top 1%), while working as a Generative AI Research Engineer at Fraunhofer IIS.

My M.Sc. thesis introduced the first spatial control framework for scale-wise autoregressive T2I generation: a ControlNet-style architecture over a frozen 2.5B-parameter backbone, six control modalities, and a unified model with learned modality routing. My work spans diffusion models for signal reconstruction, transformer pre-training for low-resource NLP, production RAG systems, and agentic LLM pipelines.

I have 5 peer-reviewed publications across NLP, TTS, and applied ML. I love the challenge of translating research insights into reliable, maintainable software.

🇬🇧 English — Fluent🇫🇷 French — Advanced🇲🇦 Arabic — Native🇩🇪 German — Learning
🧠

Deep Learning

Diffusion models, transformers, TTS: trained and evaluated across diverse domains and modalities.

🔗

LLM Systems

RAG pipelines, agentic workflows with LangGraph, multi-LLM orchestration, and streaming backends.

⚙️

MLOps & Automation

Airflow, n8n, FastAPI, Docker: end-to-end pipelines from data to deployed, monitored systems.

📄

Published Researcher

5 papers in IEEE, Springer, MDPI spanning NLP, TTS, and e-commerce ML.

Technical skills

Tools of the trade

🔬

ML & Deep Learning

PyTorchTensorFlowScikit-learnXGBoostDiffusion ModelsTransformersTensorBoard
💬

NLP & Speech

LLMsHugging FaceFine-tuning (LoRA/PEFT)TTS (FastSpeech 2)HiFi-GANspaCy
🤖

LLM Systems & RAG

LangChainLangGraphLlamaIndexRAGQdrantFAISSOpenAI APIAnthropic APIGroq API

Automation & Backend

n8nApache AirflowFastAPIREST APIsSSE StreamingDockerDocker Compose
🛠

Infrastructure

PythonSQLGitCI/CDLinux
🌍

Languages

English (Fluent)French (Advanced)Arabic (Native)German (Learning)

Personal projects

Things I've built

Side projects exploring the frontier of LLM systems, agentic AI, and ML automation.

🔬
M.Sc. Thesis · FAU · 2026

Spatially Controlled SWITTI

First spatial control framework for next-scale autoregressive T2I generation

Integrated spatial conditioning into SWITTI, a 2.5B-parameter scale-wise autoregressive T2I model. The only work addressing controllable T2I in the next-scale AR paradigm. Two novel architectures (31M and ~2.5B trainable params), six control modalities, and a unified model with learned modality routing that matches specialist performance within 10–15%.

PyTorchGenerative AIDINOv2ControlNet-styleAutoregressiveFAUFirst of its kind
Read the full case study
🔭

PaperLens

2026

RAG system for research papers with a three-stage retrieval pipeline: bi-encoder search (MiniLM + Qdrant), cross-encoder reranking (ms-marco), and diversity-aware selection ensuring all indexed sources are represented. Hybrid dense + BM25 search with Reciprocal Rank Fusion. Groq (Llama-3.3-70b) for cited answer generation. Served via Streamlit UI, FastAPI, and CLI.

RAGQdrantFastAPICross-encoder RerankingGroqHybrid Search (RRF)Streamlit
📰

CurateAI

(AI News Curation Bot)
2026

Airflow 4-task DAG (twice daily) that polls Discord reactions to build a preference profile, fetches AI news via Tavily with SQLite deduplication, scores articles with Claude + Instructor for structured output, and publishes Discord embeds with 👍/👎 buttons. User reactions are re-injected as few-shot examples into Claude's scoring prompt each run, making the bot more accurate without any retraining.

Apache AirflowClaude APIInstructorTavilyDiscord BotFew-shot LearningSQLiteDocker
🚀

ApplAI

2026

5-phase automated job application pipeline. Scrapes 4 job boards (Arbeitnow, Remotive, RemoteOK, Adzuna), pre-filters with 50+ AI/ML keywords, scores with Gemini 2.5 Flash, then generates LaTeX CVs & cover letters via Claude, compiled with pdflatex behind a LaTeX safety validator. Discord Approve/Reject/Rescue buttons (Ed25519-verified), Notion dashboard, and a preference feedback loop that refines future scoring. n8n 12h cron, security-first throughout.

n8nFastAPIGemini 2.5 FlashClaude APIDiscord BotLaTeX / pdflatexNotionEd25519Docker
✍️

MotiGen

2026

Agentic motivation letter generator built on a LangGraph 4-node pipeline with typed state: CV parsing → JD parsing → live company research (dual Tavily queries for culture + role context) → letter generation. Supports Claude, Groq, and OpenAI via a unified model factory. FastAPI backend with SSE streaming for real-time output; Streamlit UI for file-upload workflows.

LangGraphLangChainClaude / Groq / OpenAIFastAPI + SSETavilyPydantic v2Streamlit

Work experience

Where I've worked

Generative AI Research Engineer

Mar 2025 — Present

Fraunhofer IIS · Nuremberg, Germany · Working Student

  • Training diffusion-based generative models for GNSS signal reconstruction under real-world interference conditions; iterating on architectures and loss functions to stabilise training and prevent mode collapse.
  • Designed systematic evaluation frameworks measuring model robustness under noise and distribution shift; experimented with spectrogram and complex IQ data representations.
  • Managed the full experimental lifecycle: hypothesis formulation, implementation, ablation studies, and documentation. Tracked all experiments with TensorBoard.
  • Collaborated with signal processing engineers to translate experimental findings into concrete modelling decisions.

Applied Machine Learning Engineer

Dec 2023 — Feb 2025

August-Wilhelm Scheer Institut · Saarbrücken, Germany · Working Student

  • Built end-to-end ML pipelines for garment return prediction on cold-start products with no transaction history, achieving 86% balanced accuracy; addressed class imbalance and feature sparsity.
  • Refactored and parallelized preprocessing pipelines, achieving a 5× runtime speedup and significantly improving iteration speed and reproducibility.
  • Conducted feature importance analysis to identify key return drivers, enabling interpretable recommendations for business stakeholders.

Development Engineer

Jun 2023 — Aug 2023

Hightech Payment Systems · Casablanca, Morocco

  • Enhanced PowerCARD, HPS's global payment switching and card management platform, to meet VISA and Mastercard compliance requirements.
  • Worked with SQL databases, Docker containers, CI/CD pipelines, and Linux environments in a professional engineering setting.

Applied NLP Research Engineer

Sep 2022 — May 2023

Al Akhawayn University · Ifrane, Morocco · Part-time

  • Pre-trained two transformer language models (DarELECTRA 52M, DarRoBERTa 80M) for low-resource Moroccan Darija on a 1 GB code-mixed corpus; fine-tuned on three downstream tasks.
  • Text summarization: DarELECTRA achieved ROUGE-1 19.25 / ROUGE-L 18.01, state-of-the-art among all tested models including ARBERT and MARBERT.
  • Topic classification: F1 0.84 / accuracy 0.86; offensive language detection: 90% accuracy / 85% F1; published at IEEE CiSt 2023 and MDPI 2024.

Applied ML Engineer (Intern)

May 2022 — Jul 2022

Wenov, Attijariwafa Bank Innovation Lab · Casablanca, Morocco

  • Built a transformer-based intent classification system to automatically route multilingual client inquiries to relevant departments, replacing a legacy rule-based system.
  • Outperformed traditional ML baselines by +7% accuracy / +4% F1; evaluated across multiple language variants and edge cases.

Education

Academic background

M.Sc. Artificial Intelligence

Friedrich-Alexander-Universität Erlangen-Nürnberg

Grade 1.2 · Top 1%Oct 2023 — May 2026

Thesis: Finetuning Visual Autoregressive Models for Controllable Image Generation. First spatial control framework for scale-wise autoregressive T2I generation — ControlNet-style architecture over a frozen 2.5B-parameter SWITTI backbone, six control modalities, unified model with learned modality routing.

Pattern RecognitionML for Time SeriesAdvanced Programming

B.Sc. Computer Science

Al Akhawayn University

GPA 3.83 / 4.0 · Summa Cum LaudeSep 2018 — Jul 2022

Capstone: Darija Text-to-Speech Synthesis using FastSpeech 2 + HiFi-GAN on a 2-hour low-resource dataset. MOS 3.905; published at ICDTA 2025.

Probability & StatisticsDatabase SystemsBig DataData Structures

Contact

Let's connect

Open to full-time roles, research collaborations, and interesting problems. Based in Fürth, Germany, open to relocation.

Whether you're hiring, want to collaborate on a project, or just want to talk AI, my inbox is open.