Open to new opportunities

Saad
Driouech

Machine Learning Engineer with a research background in generative AI and applied ML. M.Sc. thesis introduced the first spatial control framework for scale-wise autoregressive T2I generation. Experience spans diffusion models, autoregressive generation, RAG pipelines, and AI automation. 5 peer-reviewed publications. Based in Fürth, Germany.

✉ Get in touch ↓ Download CV ⚡ View projects GitHub Scholar

5Publications

3+Years ML exp.

1.1M.Sc. grade

4Languages

About me

Researcher. Engineer.
Builder.

I'm a Machine Learning Engineer with a research background in generative AI, NLP, and applied ML. M.Sc. in Artificial Intelligence from Friedrich-Alexander-Universität Erlangen-Nürnberg, grade 1.1 (passed with distinction). Currently working as a Generative AI Research Engineer at Fraunhofer IIS.

My M.Sc. thesis introduced the first spatial control framework for scale-wise autoregressive T2I generation: a ControlNet-style architecture over a frozen 2.5B-parameter backbone, six control modalities, and a unified model with learned modality routing. My work spans diffusion models for signal reconstruction, transformer pre-training for low-resource NLP, production RAG systems, and agentic LLM pipelines.

I have 5 peer-reviewed publications across NLP, TTS, and applied ML. I love the challenge of translating research insights into reliable, maintainable software.

🇬🇧 English — Fluent🇫🇷 French — Advanced🇲🇦 Arabic — Native🇩🇪 German — Learning

🧠

Deep Learning

Diffusion models, transformers, TTS: trained and evaluated across diverse domains and modalities.

🔗

LLM Systems

RAG pipelines, agentic workflows with LangGraph, multi-LLM orchestration, and streaming backends.

⚙️

MLOps & Automation

Airflow, n8n, FastAPI, Docker: end-to-end pipelines from data to deployed, monitored systems.

📄

Published Researcher

5 papers in IEEE, Springer, MDPI spanning NLP, TTS, and e-commerce ML.

Technical skills

Tools of the trade

🔬

ML & Deep Learning

PyTorchTensorFlowScikit-learnXGBoostDiffusion ModelsTransformersTensorBoard

💬

NLP & Speech

LLMsHugging FaceFine-tuning (LoRA/PEFT)TTS (FastSpeech 2)HiFi-GANspaCy

🤖

LLM Systems & RAG

LangChainLangGraphLlamaIndexRAGQdrantFAISSOpenAI APIAnthropic APIGroq API

⚡

Automation & Backend

n8nApache AirflowFastAPIREST APIsSSE StreamingDockerDocker Compose

🛠

Infrastructure

PythonSQLGitCI/CDLinux

🌍

Languages

English (Fluent)French (Advanced)Arabic (Native)German (Learning)

Personal projects

Things I've built

Side projects exploring the frontier of LLM systems, agentic AI, and ML automation.

🔬

M.Sc. Thesis · FAU · 2026

Spatially Controlled SWITTI

First spatial control framework for next-scale autoregressive T2I generation

Integrated spatial conditioning into SWITTI, a 2.5B-parameter scale-wise autoregressive T2I model. The only work addressing controllable T2I in the next-scale AR paradigm. Two novel architectures (31M and ~2.5B trainable params), six control modalities, and a unified model with learned modality routing that matches specialist performance within 10–15%.

PyTorchGenerative AIDINOv2ControlNet-styleAutoregressiveFAUFirst of its kind

Read the full case study

🔭

PaperLens

2026

RAG system for research papers with a three-stage retrieval pipeline: bi-encoder search (MiniLM + Qdrant), cross-encoder reranking (ms-marco), and diversity-aware selection ensuring all indexed sources are represented. Hybrid dense + BM25 search with Reciprocal Rank Fusion. Groq (Llama-3.3-70b) for cited answer generation. Served via Streamlit UI, FastAPI, and CLI.

RAGQdrantFastAPICross-encoder RerankingGroqHybrid Search (RRF)Streamlit

📰

CurateAI

(AI News Curation Bot)

2026

Airflow 4-task DAG (twice daily) that polls Discord reactions to build a preference profile, fetches AI news via Tavily with SQLite deduplication, scores articles with Claude + Instructor for structured output, and publishes Discord embeds with 👍/👎 buttons. User reactions are re-injected as few-shot examples into Claude's scoring prompt each run, making the bot more accurate without any retraining.

Apache AirflowClaude APIInstructorTavilyDiscord BotFew-shot LearningSQLiteDocker

🚀

ApplAI

2026

5-phase automated job application pipeline. Scrapes 4 job boards (Arbeitnow, Remotive, RemoteOK, Adzuna), pre-filters with 50+ AI/ML keywords, scores with Gemini 2.5 Flash, then generates LaTeX CVs & cover letters via Claude, compiled with pdflatex behind a LaTeX safety validator. Discord Approve/Reject/Rescue buttons (Ed25519-verified), Notion dashboard, and a preference feedback loop that refines future scoring. n8n 12h cron, security-first throughout.

n8nFastAPIGemini 2.5 FlashClaude APIDiscord BotLaTeX / pdflatexNotionEd25519Docker

✍️

MotiGen

2026

Agentic motivation letter generator built on a LangGraph 4-node pipeline with typed state: CV parsing → JD parsing → live company research (dual Tavily queries for culture + role context) → letter generation. Supports Claude, Groq, and OpenAI via a unified model factory. FastAPI backend with SSE streaming for real-time output; Streamlit UI for file-upload workflows.

LangGraphLangChainClaude / Groq / OpenAIFastAPI + SSETavilyPydantic v2Streamlit

Work experience

Where I've worked

Generative AI Research Engineer

Mar 2025 — Present

Fraunhofer IIS · Nuremberg, Germany · Working Student

Training diffusion-based generative models for GNSS signal reconstruction under real-world interference conditions; iterating on architectures and loss functions to stabilise training and prevent mode collapse.
Designed systematic evaluation frameworks measuring model robustness under noise and distribution shift; experimented with spectrogram and complex IQ data representations.
Managed the full experimental lifecycle: hypothesis formulation, implementation, ablation studies, and documentation. Tracked all experiments with TensorBoard.
Collaborated with signal processing engineers to translate experimental findings into concrete modelling decisions.

Applied Machine Learning Engineer

Dec 2023 — Feb 2025

August-Wilhelm Scheer Institut · Saarbrücken, Germany · Working Student

Built end-to-end ML pipelines for garment return prediction on cold-start products with no transaction history, achieving 86% balanced accuracy; addressed class imbalance and feature sparsity.
Refactored and parallelized preprocessing pipelines, achieving a 5× runtime speedup and significantly improving iteration speed and reproducibility.
Conducted feature importance analysis to identify key return drivers, enabling interpretable recommendations for business stakeholders.

Development Engineer

Jun 2023 — Aug 2023

Hightech Payment Systems · Casablanca, Morocco

Enhanced PowerCARD, HPS's global payment switching and card management platform, to meet VISA and Mastercard compliance requirements.
Worked with SQL databases, Docker containers, CI/CD pipelines, and Linux environments in a professional engineering setting.

Applied NLP Research Engineer

Sep 2022 — May 2023

Al Akhawayn University · Ifrane, Morocco · Part-time

Pre-trained two transformer language models (DarELECTRA 52M, DarRoBERTa 80M) for low-resource Moroccan Darija on a 1 GB code-mixed corpus; fine-tuned on three downstream tasks.
Text summarization: DarELECTRA achieved ROUGE-1 19.25 / ROUGE-L 18.01, state-of-the-art among all tested models including ARBERT and MARBERT.
Topic classification: F1 0.84 / accuracy 0.86; offensive language detection: 90% accuracy / 85% F1; published at IEEE CiSt 2023 and MDPI 2024.

Applied ML Engineer (Intern)

May 2022 — Jul 2022

Wenov, Attijariwafa Bank Innovation Lab · Casablanca, Morocco

Built a transformer-based intent classification system to automatically route multilingual client inquiries to relevant departments, replacing a legacy rule-based system.
Outperformed traditional ML baselines by +7% accuracy / +4% F1; evaluated across multiple language variants and edge cases.

Education

Academic background

M.Sc. Artificial Intelligence

Friedrich-Alexander-Universität Erlangen-Nürnberg

Grade 1.1 · Passed with DistinctionOct 2023 — May 2026

Thesis: Finetuning Visual Autoregressive Models for Controllable Image Generation. First spatial control framework for scale-wise autoregressive T2I generation — ControlNet-style architecture over a frozen 2.5B-parameter SWITTI backbone, six control modalities, unified model with learned modality routing.

Pattern RecognitionML for Time SeriesAdvanced Programming

B.Sc. Computer Science

Al Akhawayn University

GPA 3.83 / 4.0 · Summa Cum LaudeSep 2018 — Jul 2022

Capstone: Darija Text-to-Speech Synthesis using FastSpeech 2 + HiFi-GAN on a 2-hour low-resource dataset. MOS 3.905; published at ICDTA 2025.

Probability & StatisticsDatabase SystemsBig DataData Structures

Publications

Peer-reviewed work

[1]

Let's connect

Open to full-time roles, research collaborations, and interesting problems. Based in Fürth, Germany, open to relocation.

Whether you're hiring, want to collaborate on a project, or just want to talk AI, my inbox is open.

LinkedInlinkedin.com/in/saad-driouech

Google Scholar5 publications · citations

GitHubgithub.com/Saad-Driouech

Saad
Driouech