Harnessing Multiple Data Streams and Artificial Intelligence to Better Predict Flu

Motivation for developing spatio-temporal network and ensemble approaches. a Heatmap of pairwise %ILI correlations between all states in the study over the period September 30, 2012 to May 14, 2017. Five clusters of intercorrelated states are denoted by black boxes. b Geographic distribution of the five identified clusters. c RMSE improvement of Net over ARGO over the period September 28, 2014 to May 14, 2017. The improvement of Net is here defined as the inverse RMSE ratio of Net and ARGO, so values above 1 indicate improvement. d RMSE improvement of ARGONet over ARGO over the same period

Influenza is highly contagious and easily spreads as people move about and travel, making tracking and forecasting flu activity a challenge. While the CDC continuously monitors patient visits for flu-like illness in the U.S., this information can lag up to two weeks behind real time. A new study, led by the Computational Health Informatics Program (CHIP) at Boston Children’s Hospital, combines two forecasting methods with machine learning (artificial intelligence) to estimate local flu activity. Results are published in Nature Communications.

When the approach, called ARGONet, was applied to flu seasons from September 2014 to May 2017, it made more accurate predictions than the team’s earlier high-performing forecasting approach, ARGO, in more than 75 percent of the states studied. This suggests that ARGONet produces the most accurate estimates of influenza activity available to date, a week ahead of traditional healthcare-based reports, at the state level across the U.S.

“Timely and reliable methodologies for tracking influenza activity across locations can help public health officials mitigate epidemic outbreaks and may improve communication with the public to raise awareness of potential risks,” says Mauricio Santillana, PhD, a CHIP faculty member and the paper’ senior author.

Learning about localized flu patterns

The ARGONet approach uses machine learning and two robust flu detection models. The first model, ARGO (AutoRegression with General Online information), leverages information from electronic health records, flu-related Google searches and historical flu activity in a given location. In the study, ARGO alone outperformed Google Flu Trends, the previous forecasting system that operated from 2008 to 2015.

To improve accuracy, ARGONet adds a second model, which draws on spatial-temporal patterns of flu spread in neighboring areas. “It exploits the fact that the presence of flu in nearby locations may increase the risk of experiencing a disease outbreak at a given location,” explains Santillana, who is also an assistant professor at Harvard Medical School.

The machine learning system was “trained” by feeding it flu predictions from both models as well as actual flu data, helping to reduce errors in the predictions. “The system continuously evaluates the predictive power of each independent method and recalibrates how this information should be used to produce improved flu estimates,” says Santillana.

Precision public health

The investigators believe their approach will set a foundation for “precision public health” in infectious diseases.

“We think our models will become more accurate over time as more online search volumes are collected and as more healthcare providers incorporate cloud-based electronic health records,” says Fred Lu, a CHIP investigator and first author on the paper.

Harnessing Multiple Data Streams and Artificial Intelligence to Better Predict Flu

Harnessing Multiple Data Streams and Artificial Intelligence to Better Predict Flu

Leave a Reply

Columbia Engineering Announces New Program: Master of Science in Artificial Intelligence

Flexible Governance for Biological Data Is Needed to Reduce AI’s Biosecurity Risks

AI Agents Debate More Effectively When Given Personalities and the Ability to Interrupt

Africa CDC Establishes Central Data Repository to Strengthen Public Health Surveillance

70 Percent of Decision Makers Are Losing Sleep Over Critical Data Security Concerns

Overtones Can Provide Faster Data Communication

Racial Inequality in the Deployment of Rooftop Solar Energy in the U.S.

Leave a Reply