AI Engineer · Cottbus, Germany

I build AI systems that are meant to be used.

Two years building the backend and infrastructure behind an industrial-IoT AI platform at Perinet, and a set of my own projects going deep on RAG, agents and LLMOps. I care less about the latest model and more about whether a system measurably works — and can show that it does.

View projects GitHub Email

Available full-time from summer 2026 · open to relocate

Stack · Python, LangGraph, FastAPI, Docker/K8s M.Sc. AI · BTU Cottbus ~3 yrs · professional software

01About// who & what

I'm an AI Engineer focused on building systems that work in the real world — reliable tools people use, not demos that impress once and break.

At Perinet (industrial IoT) I've spent two years on the backend and infrastructure side of their AI platform: Python and Go services wired into real-time MQTT sensor streams, FastAPI APIs, Docker and Kubernetes, CI/CD. I also ran the model-benchmarking that shaped the platform's design decisions.

The deeper RAG, agentic and LLMOps work I've driven through my own projects: hybrid retrieval with RAGAs and MLflow evaluation, multi-agent systems on LangGraph, and an observability dashboard that traces every model call's latency and cost.

I'm finishing my M.Sc. in Artificial Intelligence at BTU Cottbus, with close to three years of professional software experience overall, including earlier enterprise work at Cognizant. Open to AI / ML / LLM Engineer roles across Germany.

M.Sc.

Artificial Intelligence, BTU

~3 yrs

professional software

2 yrs

AI engineering at Perinet

open-source projects

02Experience// where I've worked

Jul 2024 — May 2026

AI Engineer (Working Student)

Perinet GmbH · Cottbus, Germany

Built Python and Go backend services connecting LLM workflows to real-time MQTT sensor streams — versioned FastAPI REST APIs, containerized with Docker and Kubernetes, with automated GitHub Actions CI/CD for zero-touch deployments.
Led the containerization workstream for the company's AI platform — a chatbot, real-time MQTT anomaly detection, and a sensor-data exploration tool — handling profiling, QA, and deployment integration with the engineering and ML teams.
Owned the model-benchmarking workstream: evaluated retrieval speed, generation quality, and trade-offs across model variants to inform the platform's chatbot and corpus design.

Oct 2021 — May 2022

Software Engineer Trainee

Cognizant Technology Solutions · India

Built and maintained enterprise banking applications (COBOL, JCL, DB2) in agile sprints; developed structured debugging and production-deployment practices.

03Projects// selected open source

GraphRAG Agent

Builds a knowledge graph from documents, then answers multi-hop questions by traversing the graph instead of flattening it into chunks — entity/relation extraction, k-hop subgraph retrieval, grounded answers with citations.

knowledge graph · k-hop retrieval · cited answers

GraphRAGnetworkxPydanticLLM extractionPython

GraphRAG Studio

The full-stack app over GraphRAG Agent: upload documents, watch a typed knowledge graph build live, then chat over it with k-hop subgraph retrieval and cited answers. Next.js front end with an interactive force-graph, FastAPI back end.

live graph viz · k-hop retrieval · cited chat

Next.jsTypeScriptReactFastAPIGraphRAG

Multi-Agent Research Pipeline

A 4-agent system (Planner → Researcher → Writer → Critic) built on LangGraph state machines that turns a question into a sourced report, fully automated end to end.

LangGraph · strict role boundaries · CI/CD

LangGraphCrewAIPydanticFastAPIDocker

RAG Evaluation System

A hybrid-retrieval RAG pipeline (BM25 + dense + Reciprocal Rank Fusion) with an automated RAGAs/MLflow evaluation harness and regression alerts on retrieval quality.

0.94 hit@5 · 0.96 citation presence

QdrantRAGAsMLflowLangChainFastAPI

LLMOps Observability Dashboard

A self-hosted dashboard that traces every model call's latency, token counts and per-model cost across GPT-4o, Claude, Gemini and DeepSeek — no external tracing service.

full-stack · multi-stage Docker · 12 tests

FastAPIReactTypeScriptPostgreSQLDocker

Multilingual News NLP Pipeline

An end-to-end German news pipeline: Whisper ASR, cross-lingual NER, fine-tuned event classification, translation and summarization — engineered to run on a single 4 GB GPU.

+13% F1 · 8.4× faster inference

WhisperXLM-RoBERTaPyTorchMLflow

LLM Fine-Tuning — JD Extractor

Fine-tuned Qwen2-0.5B with QLoRA (4-bit) to extract structured JSON from job descriptions, training ~0.44% of parameters — reliable structured output from messy text.

100% JSON validity · <4 min on 4 GB GPU

QLoRAQwen2PEFTPyTorch

Resume Tailor

A CLI tool I use daily that reads a job description, tailors a résumé and cover letter, and runs its own ATS and regression checks before producing the PDF.

multi-stage LLM pipeline · self-checking

PythonLLM agentsLaTeXCLI

04Skills// toolkit

AI & Agents

LangGraphLangChainCrewAIRAGAgentic AIPrompt EngineeringStructured OutputsMCPTool Use

LLMOps & Evaluation

RAGAsMLflowLangfuseLLM-as-JudgeHybrid RetrievalQdrantChromaDBpgvector

Programming & Backend

PythonGoFastAPIREST / gRPCPyTorchReactTypeScriptSQL / PostgreSQL

Infrastructure & DevOps

DockerKubernetesGitHub ActionsGitLab CI/CDMQTTAzureAWSLinux

05Education// academic

M.Sc. Artificial Intelligence

Brandenburg University of Technology · Cottbus

Oct 2022 — 2026 (thesis phase)

Focus: Machine Learning, Computer Vision, Explainable ML. Thesis on content-aware Vision Transformer optimization for efficient inference on edge devices (PyTorch).

B.Sc. Computer Application

BVM Holy Cross College · Kottayam, India

Jul 2018 — Mar 2021

Foundations in software development, data structures, databases and systems.

06Languages// spoken

EnglishFluent · C1

GermanB1 · improving

MalayalamNative

07Contact// get in touch

Open to AI / ML / LLM Engineer roles across Germany — remote, hybrid or on-site, and happy to relocate. The fastest way to reach me is email.

Email
aravindpradeep001@gmail.com LinkedIn
aravind-pradeepmadathinal GitHub
github.com/axon011 Résumé
Download PDF