Welcome to Koenig Data Solutions - Koenig Data Solutions

Expertise

Philosophy

Scientific Rigor

I bring the methodology of leading scientific research to industrial data. I deliver models that don't just "score well" but are validated, reproducible and plausible.

Clean Engineering

Sustainable software is built on solid engineering principles. I prioritize modern technology, professional design patterns and production-ready packaging in both Python and R.

Decision Intelligence

Data is only valuable if it leads to action. I build the interactive bridges – Dashboards, APIs, and Tools – that translate complex raw output into clear, actionable business logic.

Skills

$ ./list_skills.sh

Programming Languages:

[Python] [SQL] [R] [JavaScript] [Shell scripting]

Machine Learning & AI:

[Regression] [Classification] [Hyperparameter tuning] [Model Evaluation] [Agentic workflows] [Anomaly Detection] [XGBoost] [PyTorch] [scikit-learn] [caret]

Web & APIs:

[HTML] [CSS] [async/await] [FastAPI] [Flask] [Plotly Dash] [R Shiny]

Data Engineering:

[PostgreSQL] [MySQL] [DuckDB] [Neo4j] [pandas] [tidyverse] [PowerQuery] [Apache Airflow]

Visualization & Reporting:

[Plotly] [R Markdown] [Quarto] [ggplot2] [matplotlib]

Tooling & Infrastructure:

[Linux] [Git] [Docker] [AWS] [On-premises hosting] [CI/CD] [Camunda] [Jira]

Education & Background

2008 – 2012

B.Sc.

HTW Dresden

Environmental Monitoring

2012 – 2014

M.Sc.

University of Göttingen

Biodiversity, Ecology & Evolution

2015 – 2019

Dr. rer. nat.

University of Göttingen

Biodiversity, Ecology & Evolution

2019 – 2021

Postdoctoral Researcher

HU Berlin • University of Potsdam

Research and teaching

2021 – Present

Freelance Developer

König Data Solutions

Data Science and Software Engineering

Services

Machine Learning

Development of predictive models and classification systems
Design and implementation of spatial and time-series models
Model evaluation, validation, and monitoring

Generative AI

RAG-powered AI assistants and NLP integration
Vector databases and semantic search
Agentic workflows and automated tool integration

Data Engineering

Architecture design for transactional (OLTP) and analytical (OLAP) systems
Graph-based modeling and knowledge graphs
Performance-focused ELT/ETL pipeline development

Reporting & Analytics

Production-ready interactive web applications
User-centric internal tools for data exploration
Reproducible, automated reporting and document generation

Infrastructure

Setup of containerized environments with Docker and Docker Compose
Automated CI/CD pipelines for seamless app delivery
Linux server configuration and secure on-prem setups

Research Support

Implementation of standards for computational performance and reproducibility
Internal R and Python package development
Hardening scientific prototypes for production environments

Portfolio

Dec 2025 – Present

Environmental Monitoring Platform | LFB

Data quality assurance • Infrastructure

Optimization of data pipelines to improve data quality and availability. Implementation of algorithms for anomaly detection. Data storage and provisioning with PostgreSQL and Supabase.

PostgreSQL • Python • R • Supabase • Anomaly Detection • Data Ingestion

May 2024 – Present

Data Intelligence Platform | KWS Saat

Biotechnology • Full-Stack Development

Development of a data analysis and visualization platform for plant breeding using FastAPI, Neo4j, and Plotly Dash. Interactive network visualizations with Cytoscape. Integrated generative AI models to support data-driven breeding decisions.

Python • FastAPI • Neo4j • Plotly Dash • Cytoscape • LangChain • AWS • Docker

Jun 2024 – Sep 2025

Technical Infrastructure for Biodiversity research | iDiv

Research • Data Science

Built an integrated data platform for heterogeneous ecological data types. Designed a DuckDB-based storage architecture for efficient spatial operations and developed automated R-package pipelines for data ingestion. Implemented advanced multi-species modeling using Deep Learning and ensemble predictions.

R • DuckDB • torch • caret • Ensemble Modeling • targets • Spatial Data

Jun 2023 – Oct 2024

Process automation | Offerista Group

Digital Marketing • Data Automation

Optimized and automated localized digital marketing workflows. Restructured complex MS Excel/PowerQuery data architectures to improve maintainability and performance. Implemented automated data validation and cleaning, with direct integration of external resources via Looker API.

MS Excel • PowerQuery • Looker API • VBA

Aug 2022 – Feb 2024

Reporting Infrastructure | European Central Bank

Finance • Enterprise Systems

Migrated the technical infrastructure underlying the ECB's financial reporting system. Developed a custom R-package ecosystem and a collaborative R-Shiny interface for automated report generation. Integrated with Oracle DB, Camunda workflows, and enterprise document management systems in an agile Scrum environment.

R • R Shiny • Python • Camunda • OracleDB • GitLab API • Timeseries Data

Nov 2021 – Jul 2022

Interactive Data Standardization Tool | KIT

Research • Web Application

Developed an R-Shiny web application for researchers to standardize and share vegetation data. Implemented robust XML processing pipelines for data validation and schema compliance. Optimized system performance and managed deployment through Dockerized server environments.

R • R Shiny • XML • Docker • Scientific Data

Christian König