An end-to-end NLP pipeline — dynamic topic models combined with a novel document-influence metric — applied to ~1M articles spanning ten years of local newspaper data to measure investigative-journalism content. A measurement problem on noisy, high-dimensional text. Dataset and code released on Harvard Dataverse.