Home - Stefan Kasberger

My name is Stefan Kasberger and I love developing applications for the common good.

In my professional life, I am a Data Scientist and DevOps Engineer. I build data pipelines and am keen on automatisation, standardization and quality assurance in software development. My approach is built upon using and creating open knowledge, such as open source and open data, to create innovative software eco-systems.

Besides that, I love to hang out with my family (Hannah, Moritz and Luise) and friends, do sports or play games. So it never gets boring. :)

Living in Vienna, Austria

Working as "Technical Consultant Big & Open Data" at the Austrian Federal Computing Center (BRZ) + Freelance Consulting

Studying "Environmental Systems Science - Geography" at the University of Graz

Volunteering: Founder of openscienceASAP, Open Science Working Group AT and Offene Wahlen AT and former chairman of Open Knowledge AT

Interests: Basketball, Football, Mountaineering, Music and Games

pyDataverse

I developed an Open Source Python module for the Dataverse community. It helps to access the Dataverse API's and use its data-types.

Dataverse (CTS & OAIS)

At the University of Vienna I operated the AUSSDA Dataverse. This included all related processes, such as OAIS alignment, the CoreTrustSeal certification and the technical strategy.

Text and Data Mining

During the Zika epidemic, I applied Text and Data Mining to extract information from thousands of academic papers to understand the virus better.

Reachability Analysis

During my study in Graz, I conducted a reachability analysis with pgRouting, QGIS, Docker and OpenStreetMap data to get hands on the full geo data pipeline.

Creating Smart Data Pipelines

Extract

Process

Analyze

Use

Developing

I use Python to create web-applications, installable packages, web-scrapers or scientific scripts. My focus is on Open Source and Open Standards, with emphasis on sustainable software development practices, such as testing, documenting and linting.

Tools

Python (Flask, FastApi, Pytest, Sphinx), Git, VSC, JavaScript, HTML, CSS

Paradigms

Open Source, OOP, Test Driven Development, Git Flow, RESTful APIs, OpenAPI

Operating

Modern web-applications are built not only with software infrastructure supporting, but also with processes surrounding it. I operate and design CI/CD pipelines & processes around applications such as Dataverse with an agile approach..

Processes

CI/CD, Monitoring, Web-Analytics, Preservation, Strategy Management

Tools

Docker, Dataverse, postgreSQL, AWS, Web-Server, Jenkins, Matomo, Selenium, Shell, Linux

Paradigms

CoreTrustSeal, OAIS, Agile, OAI-PMH, DDI

Using

Value is often created at the end of the data pipeline, when the prepared data is finally used for its intended purpose. This can be a simple visualization, a complex analytical pipeline or a service on its own.

Scientific Methods

Machine Learning, Network Analysis, Spatial Analysis, Text Data Mining, Agent-based Modelling

Scientific Tools

Python, QGIS, GRASS GIS, R, Matlab

About

Portfolio

Services

Developing

Tools

Paradigms

Operating

Processes

Tools

Paradigms

Using

Scientific Methods

Scientific Tools

References