A dataset (or data set) is a collection of data. Data is any item of information, usually numerical, that is not yet subject to interpretation. A dataset is essentially a list of numbers or other bits of information that can be used in statistical analysis.
"Big Data" is a term that describes an extremely large dataset. Computational manipulation (using computers to handle data) is usually required to make sense of big datasets.
Statistics is the practice of using data in a variety of ways: statistics collects, classifies, arranges, manipulates, and interprets datasets. If you see a report with charts and graphs, you will typically see an accompanying written analysis, or qualitative data analysis, that explains the data.
Some of the websites on this guide will lead you to raw data. Others will offer reports that interpret this data. Ask a Librarian if you are unsure of how to get what you need, and read a helpful book on how to use statistics in your research.
Statistics can be complicated and complex, and data can be manipulated in many ways to varying ends. It is good practice to study the methodologies of surveys and data collection in order to understand the results given by the researcher or institutional body.
You can cite datasets and statistics in any formatting style:
APA (OWL Purdue) - Used in the Social Sciences
example:
Citing Data Sets
Lastname, F. M. or Name of Group (Year). Title of dataset (Version No.) [Data set]. Publisher. DOI or URL
Grantmakers in the Arts. (2019). Arts funding trends, United States, 1994-present (ICPSR 37337) [Data set]. National Archive of Data on Arts & Culture. https://www.icpsr.umich.edu/icpsrweb/NADAC/studies/37337
MLA (OWL Purdue) - Used in the Humanities
Because MLA style is most often used in the humanities, it is unlikely that you will include raw scientific data in an MLA-style paper, but you may be asked to include other kinds of research in your writing. See citing Tables, Figures, and Examples for more information.
Rule of thumb, make sure to include as many of these elements in your citation:
If you have published an article that uses datasets, and want to or are required to openly share your data - good news! You can post your data in The University of Tampa's Institutional Repository.
Please see this guide to learn more about the IR and this guide to learn about scholarly publishing and managing datasets.
Macdonald-Kelce Library - The University of Tampa - 401 W. Kennedy Blvd. - Tampa, FL 33606 - 813 257-3056 - library@ut.edu - Accessibility