Classifying articles via URLs

Nick Hagar
6 min readFeb 14, 2024

Read it first on my Substack

Today, I want to work through a small exercise in language modeling. A common NLP task for researchers working with news articles is some form of topic modeling. By getting a sense of what news stories are about, aggregated into a defined vocabulary of labels, researchers can make comparisons across groups (e.g., politics news versus financial news, or hard versus soft coverage). The…

--

--

Nick Hagar

PhD student @ Northwestern University. I worked in digital media, now I study it.