This guide is intended to help you learn about text and data mining (TDM) and the resources available to you here at UF via the George A. Smathers Libraries. Before you get started, please:
Violating license agreements, even unintentionally, can result in the entire campus community losing access to critical research resources and potentially expose you and the University to legal liability.
Text and data mining (TDM) are automated techniques for analyzing large volumes of digital information to discover patterns, trends, and valuable insights. Data mining is the overarching process of finding anomalies, patterns, and correlations within large datasets, combining techniques from machine learning, statistics, and database systems. It evaluates both structured data (like database tables) and unstructured data (like text) to identify new information. Text mining is a specialized subset of data mining that focuses specifically on unstructured text-based data (like interviews, articles, or narratives). The overall goal of TDM is to transform raw data into knowledge.
![]() |