Freelance Data Scientist & Software Engineer.
I bring the methodology of leading scientific research to industrial data. I deliver models that don't just "score well" but are validated, reproducible and plausible.
Sustainable software is built on solid engineering principles. I prioritize modern technology, professional design patterns and production-ready packaging in both Python and R.
Data is only valuable if it leads to action. I build the interactive bridges – Dashboards, APIs, and Tools – that translate complex raw output into clear, actionable business logic.
Optimization of data pipelines to improve data quality and availability. Implementation of algorithms for anomaly detection. Data storage and provisioning with PostgreSQL and Supabase.
Development of a data analysis and visualization platform for plant breeding using FastAPI, Neo4j, and Plotly Dash. Interactive network visualizations with Cytoscape. Integrated generative AI models to support data-driven breeding decisions.
Built an integrated data platform for heterogeneous ecological data types. Designed a DuckDB-based storage architecture for efficient spatial operations and developed automated R-package pipelines for data ingestion. Implemented advanced multi-species modeling using Deep Learning and ensemble predictions.
Optimized and automated localized digital marketing workflows. Restructured complex MS Excel/PowerQuery data architectures to improve maintainability and performance. Implemented automated data validation and cleaning, with direct integration of external resources via Looker API.
Migrated the technical infrastructure underlying the ECB's financial reporting system. Developed a custom R-package ecosystem and a collaborative R-Shiny interface for automated report generation. Integrated with Oracle DB, Camunda workflows, and enterprise document management systems in an agile Scrum environment.
Developed an R-Shiny web application for researchers to standardize and share vegetation data. Implemented robust XML processing pipelines for data validation and schema compliance. Optimized system performance and managed deployment through Dockerized server environments.
Ready to turn your data challenges into competitive advantages? Let's discuss!