← back
Mapa digital de cidade brasileira com redes neurais e dados
Machine Learning + Epidemiology

Forecast outbreaks before
the first symptom

We cross climate and epidemiological data through Machine Learning to forecast dengue outbreaks, offering public health managers precise insights to direct interventions and health policies more efficiently.

10 Years of data
4.2M Global cases (2019)
5 PCA components
73.5% Model accuracy
Scroll

A constant and complex threat

Since 1981, dengue has established itself as endemic in Brazil. The complexity of factors makes control and prediction extremely challenging.

Mosquito Aedes aegypti em ambiente urbano brasileiro

Multiple variants

The serotypes DENV-1 through DENV-4 circulate across the country. Infection by one type does not provide immunity against the others, keeping the population vulnerable.

Severe seasonal impact

In 2019, a new DENV-2 variant caused an increase of up to 149% in cases in some Brazilian states.

Favorable conditions for the vector

The tropical climate provides ideal conditions for Aedes aegypti reproduction, intensified by improper waste disposal and lack of sanitation.

Democratizing forecasting with artificial intelligence

We apply the most modern in Machine Learning. Recent studies already demonstrate that crossing environmental data can forecast dengue outbreaks.

01

Advanced automation

We incorporate tools like AutoML and TPOT, which uses genetic programming to explore thousands of possibilities and find the ideal forecasting configuration.

02

Accessibility

Beyond accelerating predictive model production, our approach aims to democratize AI use, making Machine Learning accessible to a broader audience.

03

Focus on results

Central objective: to offer a faster, optimized and efficient method to forecast dengue outbreaks before they occur.

Vista aérea de cidade brasileira com rede de dados sobreposta
The science of data

The forecasting engine: crossing climate and public health

Our project doesn't work with guesswork; it's grounded in a decade of rigorously processed historical data.

Epidemiological

DATASUS

Complete history of dengue cases in Presidente Prudente (SP) between 2014 and 2024, analyzed week by week through epidemiological weeks.

Climate

INMET

Hourly meteorological data from the National Institute of Meteorology for the same period of 10 years: temperature, humidity, precipitation and atmospheric pressure.

Climate data was carefully treated to fill gaps and adjusted to weekly averages to perfectly synchronize with dengue medical records.

All results presented were obtained exclusively from publicly available data from conventional INMET weather stations — with no proprietary stations deployed in the field. This means the 73.5% accuracy represents a conservative baseline: with dedicated local sensors capable of capturing microclimates and hyperlocal variations, the model's predictive performance is expected to improve significantly.

Finding the needle in the haystack

With thousands of data points, we use Principal Component Analysis to reduce complexity and reveal the variables with the greatest impact.

PC1

The weight of the long term

Dew point temperature and accumulated rainfall over several weeks (4-8 weeks) are the strongest factors. The climate from weeks ago dictates today's outbreak.

PC2

The thermal shock

Marked fluctuations in temperature and atmospheric pressure dictate the daily and seasonal rhythm of the vector.

PC3

Extreme events

Abrupt variations (intense droughts or sudden torrential rains) function as rapid triggers for the disease.

PC4

Wind and humidity

Nuances in wind behavior and relative humidity complete the puzzle of Aedes aegypti proliferation.

PC5

Urban ventilation

Average wind speed and maximum gusts reveal how ventilation affects mosquito dynamics.

From data to targeted actions

The true value of data lies in how it is used. Our tool transforms raw records into strategic intelligence.

Targeted interventions

More precise and timely insights for public health authorities, enabling action before the outbreak spreads.

Resource optimization

Awareness campaigns and control measures applied in the right locations and times, before the outbreak occurs.

Interdisciplinary collaboration

Integration between epidemiology, climatology and data science to mitigate the impact of vector-borne diseases.

Gestora de saúde pública analisando dashboard de risco de dengue

Dengue is the pilot. The model is replicable.

Any disease whose incidence varies with climate can be forecast using the same approach. The engine is generic; the data changes. For the pharmaceutical industry, this means anticipating demand weeks in advance.

Arboviruses

Dengue, zika, chikungunya, urban yellow fever. Same vector, same climate logic.

Respiratory diseases

Flu, bronchitis, seasonal asthma, pneumonia. Winter peaks predictable by climate patterns.

Seasonal allergies

Rhinitis, conjunctivitis, atopic dermatitis. Humidity and temperature dictate demand for antihistamines.

Prisma de vidro refratando luz caótica em cinco feixes verdes distintos

Whoever knows where the next peak will happen prevents better, distributes better, stocks better, sells better.