Guides @ UF: Text and Data Mining: Home

Overview

This guide is intended to help you learn about text and data mining (TDM) and the resources available to you here at UF via the George A. Smathers Libraries. Before you get started, please:

Consult the general terms and conditions for using UF Library electronic resources
- UF Libraries Terms of Use: https://uflib.ufl.edu/about/user-policies/terms-of-use/
- If using an Open Access or Public Access dataset, review their terms as well.
Review the specific terms of use on our UF Licensed Data Sources page, or email er-help@uflib.ufl.edu if you plan on using any AI tools for text and data mining, as separate restrictions may apply.

Violating license agreements, even unintentionally, can result in the entire campus community losing access to critical research resources and potentially expose you and the University to legal liability.

What is text and data mining?

Text and data mining (TDM) are automated techniques for analyzing large volumes of digital information to discover patterns, trends, and valuable insights. Data mining is the overarching process of finding anomalies, patterns, and correlations within large datasets, combining techniques from machine learning, statistics, and database systems. It evaluates both structured data (like database tables) and unstructured data (like text) to identify new information. Text mining is a specialized subset of data mining that focuses specifically on unstructured text-based data (like interviews, articles, or narratives). The overall goal of TDM is to transform raw data into knowledge.

News and Updates

Copyright Act in Australia Won’t Permit Free use of Copyright Works in AI
The National Law Review, October 20, 2025
Judge skewers $1.5B Anthropic settlement with authors in pirated books case over AI training
AP News, September 8, 2025
Librarian of Congress Expands DMCA Exemption for Text and Data Mining
Association of Research Libraries, October 28, 2024
Exemption to Prohibition on Circumvention of Copyright Protection Systems for Access Control Technologies
Copyright Office, Library of Congress October 28, 2024
Academic authors 'shocked' after Taylor & Francis sells access to their research to Microsoft AI
The Bookseller, July 19, 2024
EU Artificial Intelligence Act
Official Journal (OJ) of the European Union, July 12, 2024

Text and Data Mining

Basics

Text and Data Mining Library Guides

Overview

What is text and data mining?

News and Updates