People who lose their jobs often post about it before they file paperwork. They vent, ask connections for leads, or just announce they’re unemployed. These posts, scattered across millions of social media users, turn out to predict official unemployment claims up to two weeks before government data arrives.
Researchers from New York University, the World Bank, and the University of Oxford built an AI model that scans Twitter for job loss disclosures. They trained it on 31.5 million U.S. users posting between 2020 and 2022. The system, called JoblessBERT, catches informal language that keyword searches miss—phrases like “needa job” or “neeeeeed a job!” It identified nearly three times more unemployment posts than earlier methods while staying accurate.
The model doesn’t track sentiment or opinions about the economy. It flags direct statements: I lost my job. I’m unemployed. I need work.
Fixing the Twitter-Isn’t-Everyone Problem
Twitter users skew younger than the general population. Not everyone who loses a job posts about it. The researchers inferred each user’s age, gender, and location, then adjusted their counts to match U.S. Census proportions. Post-stratification, they call it. With those corrections, they could forecast unemployment insurance claims at national, state, and city levels.
The AI used active learning, improving by focusing on ambiguous cases where a post might or might not indicate job loss. Over time, it captured a broader cross-section of users across demographics and regions.
In March 2020, unemployment claims exploded from 278,000 to nearly 6 million in two weeks as COVID-19 shut down the economy. Government systems couldn’t process the surge fast enough. Two days before the official reporting week ended, JoblessBERT predicted 2.66 million claims. The actual number: 2.9 million. Industry forecasters using traditional methods badly underestimated the spike.
“We show that our methodology consistently outperforms the industry’s forecasting average and can improve the predictions of U.S. unemployment insurance claims, up to two weeks in advance,” Samuel P. Fraiberger of the World Bank Development Impact Group explains.
Across the full study period, forecasts using the social media signal cut prediction errors by more than 50 percent compared with industry consensus.
Early Warnings During Economic Chaos
The advantage shows up most during rapid shifts. When conditions change faster than data collection cycles allow, two weeks of lead time matters. Policymakers can mobilize resources, adjust programs, or prepare communications while traditional statistics are still being compiled.
In ten cities where official data updates rarely or not at all, the model still produced reliable estimates. The digital signal holds up even without local government benchmarks to validate against.
The authors don’t suggest replacing official labor statistics. Traditional surveys remain comprehensive and methodologically rigorous. But they provide a real-time supplement. What people are experiencing right now, not what they reported in last month’s survey.
The constraint is data access. Social platforms have restricted researcher access to public posts in recent years, making this kind of work harder to sustain. If researchers can use anonymized data responsibly while protecting privacy, these signals become tools for public interest research. Right now, the next wave of job losses might appear in social feeds long before it reaches official spreadsheets. Whether anyone can still see it is another question.
PNAS Nexus: 10.1093/pnasnexus/pgaf309
ScienceBlog.com has no paywalls, no sponsored content, and no agenda beyond getting the science right. Every story here is written to inform, not to impress an advertiser or push a point of view.
Good science journalism takes time — reading the papers, checking the claims, finding researchers who can put findings in context. We do that work because we think it matters.
If you find this site useful, consider supporting it with a donation. Even a few dollars a month helps keep the coverage independent and free for everyone.