Automating Wind Turbine Data Cleaning with Machine Learning
August 26, 2025
2 min

Automating Wind Turbine Data Cleaning with Machine Learning

Discover how Delfos automates wind turbine data cleaning using Gaussian Process to boost performance and eliminate manual filtering.

Data cleaning may seem like an invisible task in the renewable energy world, but that's precisely where efficiency begins. In this study by Delfos Energy, researchers demonstrate how applying machine learning to wind turbine power curve modeling can eliminate manual work, making the process far more scalable and accurate.

The Problem: Dirty Data and Manual Inefficiency

To understand a wind turbine’s actual performance, one must analyze its power curve—the relationship between wind speed and energy production. However, data collected by SCADA systems often comes with noise, operational failures, technical limitations, or even sensor errors. And event logs are not always available or reliable.

In practice, this forces many teams to manually filter the data to remove anomalies before modeling the power curve. As you can imagine, this process is slow, subjective, and not scalable.

The Innovation: Gaussian Process as a Smart Filter

The study’s proposal is simple yet powerful: use Gaussian Process (GP), a Bayesian machine learning algorithm (a probabilistic classifier based on Bayes' Theorem), to intelligently automate this data cleaning process.

Instead of relying on manual logs or event records, GP is trained to identify normal operational patterns of the turbine. It learns from the data itself—detecting outliers and removing anomalies with high precision.

What’s more, it does all this without distorting the final outcome, as proven by comparing AEP (Annual Energy Production) values across different filtering methods.

The Result: Efficiency and Scalability

Two experiments were conducted:

  • In the first, the authors showed that the Gaussian Process-based filter produced AEP results almost identical to traditional manual filtering—with only a 0.29% difference.

Figure 1: Comparison of the adjusted power curves obtained with manual filtering and Gaussian filtering

    • In the second, GP was compared to a filter based on severity logs (when available), and once again the result was very close, with only a -0.25% difference.
Figure 2: Comparison of the adjusted power curves obtained with gravity filtering and Gaussian filtering

In both cases, Delfos’ approach proved that it is possible to automate data cleaning without compromising the final model’s accuracy. This marks a major leap forward for teams aiming for standardized and agile performance analysis.

Why Does This Matter?

This innovation reinforces Delfos’ commitment to technical excellence and the automation of critical processes in the renewable energy sector. By eliminating manual and subjective steps, we pave the way for more robust, reliable, and replicable analysis—essential ingredients for the global energy transition.

For our clients and partners, this translates into more confident decision-making, reduced rework, and a smarter platform every day.

Want to see the full technical paper?

Fill in the form below and access the full paper published by our R&D team and understand all the technical details behind this innovation.

Study published by Osmar Alexandre, Rodrigo Queiroz and Gustavo Carvalho from Delfos Energy and submitted to Brazil Windpower 2024.

Book a meeting

Let's connect and forge new partnerships

contact us

Custom Renewable Energy Solutions

Contact us today to discuss your renewable energy needs and find the perfect solution for your business.

By clicking “Accept all cookies”, you agree to the storing of cookies on your device to enhance site navigation, analyze site usage, and assist in our marketing efforts. View our Privacy Policy for more information.