Services
Services
TECHNOLOGY SERVICES
BUSINESS CASES
Experience
About us
Insights
INSIGHTS
Blog
News
Webinars
Careers
Whistleblower
Let's talk
EN
PL
DE
Polski
English
Deutsch
Blog
Michal Milosz
Latest blog posts by Michal Milosz
Latest blog posts
View all
AI Sales
AI Security
Generative AI
Databricks
CSR
MLOps
Google Cloud Platform
Data Migration
Data Analysis
Data Engineering
DevOps
Quantum Computing
Augmented Reality
Internet of Things
Blockchain Technology
Cyber Security
Cloud Computing
Data Science
Artificial Intelligence
Machine Learning
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
No blog posts found.
Data Engineering
9
min read
The "Shift Left" Revolution: Why Your PySpark Pipelines Need Unit Tests (and How to Do It)
Learn how adopting a “shift left” mindset with unit-tested, modular PySpark code, local testing, and CI/CD automation can cut cloud costs, prevent production bugs, and make your data pipelines behave like real engineered software.
Read more
Data Engineering
6
min read
The Orchestration Dilemma: Declarative vs. Imperative Patterns in Modern Data Engineering
Learn how to choose between declarative Delta Live Tables and imperative Databricks Workflows to design scalable, cost‑efficient data architectures that match your team’s culture and operational needs in the modern Azure and Databricks ecosystem.
Read more
Data Engineering
8
min read
Streamlining CI/CD for Databricks Workflows with DAB Templates and Azure DevOps/GitHub Actions
Automating Databricks deployments with DAB templates and CI/CD tools ensures fast, error-free pipelines - read to learn how.
Read more
Data Engineering
6
min read
Unity Catalog and Volumes: A Data Engineer's Perspective on Modern Data Governance in Databricks
Discover how Unity Catalog and Volumes in Databricks revolutionize data governance by enabling secure, centralized management and effortless discovery of both structured and unstructured data across your Lakehouse.
Read more
Data Engineering
16
min read
Designing Scalable Data Pipelines: Batch, Streaming, and Layered Architectures
Discover the strengths and uses of three key data architectures - find out how they work and which one fits your needs best
Read more
Data Engineering
10
min read
The Power of Orchestration: Managing Complex Workflows in Databricks
Master Databricks workflow orchestration – automate tasks, integrate Airflow & ADF, and optimize pipelines. Boost efficiency with dynamic scheduling!
Read more
DevOps
11
min read
Azure Data Factory or Apache Airflow: Which Orchestration Tool Reigns Supreme?
Compare Azure Data Factory vs Apache Airflow: key features, strengths, and limitations for data orchestration. Choose the best tool for your workflow needs.
Read more
Databricks
9
min read
Managing Large Data Sets in Databricks
Optimize large datasets in Databricks with Partitioning, Z-Ordering, Auto Optimize, Delta Lake Vacuum, Caching, and Cost Monitoring for better performance.
Read more
Data Engineering
12
min read
Configuring the Celery Kubernetes
Configure the Celery Kubernetes Executor in Airflow 2.0 to combine Celery's scalability with Kubernetes' resource optimization for efficient workflows.
Read more
Data Engineering
7
min read
The Future of Data Engineering – Trends to Watch in 2025
Discover the top data engineering trends for 2025, including AI-driven automation, Lakehouse architecture, serverless and edge computing, and sustainable data practices. Embracing these innovations will help organisations optimise data operations and stay ahead in a rapidly changing landscape.
Read more
1
Next