Simple English Wikipedia
Simplewiki
The dataset is composed of the content of Simple Wikipedia including articles and revision history in XML. The XML dumps are in a Export format and compressed in bzip2 and .7z formats; while SQL dumps are in mysqldump https://meta.wikimedia.org/wiki/Data_dumps
Dataset of National Endowment for the Humanities grants, 1980-1989
NEH grants dataset, 1980-1989
"Information about NEH grants is contained in the files named NEH_Grantsxxxxx.zip. These files are broken into decades. The data is described in the file NEH_GrantsDictionary.pdf. Note that Metadata for grants that antedate the NEH electronic grants management system is sparser than that for more recent grants. The XML files for grants are available in two formats: one with hierarchical XML (a grant may have…
Contributor:
National Endowment for the Humanities
Date:2019-01-01
Software, E-Resource
Transcription dataset from "the crystal: a record of visions and conferences with the in-dwellers of the spirit world” by Frederick Hockley, Harry Houdini Collection, Rare Book and Special Collection Division
By the People transcription campaign title: Seers, spiritualists, and the spirit work : the experiments of Frederick Hockley | Crystal : a record of visions and conferences with the in-dwellers of the spirit world : transcription dataset
This dataset is an export of transcriptions for 4,656 images from the "the crystal : a record of visions and conferences with the in-dwellers of the spirit world” created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign Seers, Spiritualists, and the Spirit World: The experiments of Frederick Hockley. It contains text created by volunteers through a transcription…
Contributor:
Hockley, Frederick - By the People (Program) - Harry Houdini Collection (Library of Congress)
Date:2023
Software, E-Resource
Amazing grace : [dataset]
Amazing grace collection | Chasanoff/Elozua Amazing grace collection
Collection compiled by Allan Chasanoff and Raymon Elozua. The collection highlights the history of the hymn “Amazing Grace” from the earliest printing of the song to selected performances of it on published and field recordings. These items have been collected from several divisions in the Library of Congress, including the Music Division, the American Folklife Center, the Motion Picture, Broadcasting and Recorded Sound Division,…
Contributor:
Chasanoff, Allan - Elozua, Raymon - Library of Congress
Date:2024
Software, E-Resource
Transcription datasets from the Blackwell family papers, Manuscript Division
By the People transcription campaign title : The Blackwells: an extraordinary family
This dataset is an export of transcriptions for 56,187 images from the Blackwell Family Papers digital collection created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign The Blackwells: An Extraordinary Family. It contains text created by volunteers through a transcription and review process, volunteer-created tags, digital collections metadata, and metadata representing the arrangement of the items in…
Contributor:
By the People (Program)
Date:2024
Software, E-Resource
Transcription dataset from the World War II Rumor Project, American Folklife Center
By the People transcription campaign title : information and disinformation : the World War II rumor project
This dataset is an export of transcriptions for 9,189 images from the World War II Rumor Project Collection digital collection created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign Information and Disinformation: The World War II Rumor Project. It contains text created by volunteers through a transcription and review process, volunteer-created tags, digital collections metadata, and metadata…
Contributor:
By the People (Program)
Date:2022
Collection
Community Memorialization Project Data pages
Website for dataset created 2015. The Community Memorialization Project began in 2015. Project by: Radhika Hettiarachchi with Search for Common Ground, Sri Lanka. The Community Memorialization Project is an archive of 320 village histories and life stories of individuals and groups, collected and archived to memorialize the experiences of violence and conflict in three Sri Lankan districts. Using the archive, the project creates opportunities…
Contributor:
Search for Common Ground (Sri Lanka) - Hettiarachchi, Radhika - Centre for Human Resource Development (Colombo, Sri Lanka) - Viluthu - Herstories Project (Sri Lanka)
Date:2015
Collection
Herstories Project Data pages
Website for dataset created July 2012. The Herstories Project --This is a list of the documentation available from Ampara, Batticaloa, Kilinochchi, Kurunegala, Moneragala, Mullaitivu, Vavuniya. Project date 2012-2013. Project by Radhika Hettiarachchi with Viluthu Centre for Human Resource Development, Sri Lanka The Herstories Archive is an oral history archive of women's life stories from Sri Lanka's civil war. This documentation initiative comprises life histories…
Contributor:
Centre for Human Resource Development (Colombo, Sri Lanka) - Viluthu - Herstories Project (Sri Lanka) - Hettiarachchi, Radhika
Date:2012
Software, E-Resource
Transcription dataset from the William A. Gladstone Afro-American military collection, Manuscript Division
By the People transcription campaign title : Brothers in arms : the Gladstone Afro-American military collection
This dataset is an export of transcriptions for 3,097 images from the William A. Gladstone Afro-American Military Collection created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign Brothers in Arms: The Gladstone Afro-American Military Collection. It contains text created by volunteers through a transcription and review process, volunteer-created tags, digital collections metadata, and metadata representing the arrangement…
Contributor:
Gladstone, William A. - By the People (Program)
Date:2022
Software, E-Resource
Transcription dataset from the Mary Church Terrell Papers, Manuscript Division
By the People transcription campaign title : Mary Church Terrell : advocate for African Americans and women
This dataset is an export of transcriptions for 24,936 images from the Mary Church Terrell Papers created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign Mary Church Terrell: Advocate for African Americans and Women. It contains text created by volunteers through a transcription and review process, volunteer-created tags, digital collections metadata, and metadata representing the arrangement of…
Contributor:
Terrell, Mary Church - By the People (Program)
Date:2018
Software, E-Resource
Transcription dataset from the George Washington Papers, Manuscript Division
By the People transcription campaign title : Ordinary Lives in George Washington's Papers
This dataset is an export of transcriptions for 593 images from the George Washington Papers digital collection created by volunteers participating in the Library of Congress crowdsourcing program, By the People (https://crowd.loc.gov) campaign, Ordinary Lives in George Washington's Papers. It contains text created by volunteers through a transcription and review process, volunteer-created tags, digital collections metadata, and metadata representing the arrangement of the items…
Contributor:
By the People (Program) - Washington, George
Date:2023
Software, E-Resource
Transcription dataset from the Frederick Douglass papers, Manuscript Division
By the People transcription campaign title : Yours Truly, Frederick Douglass
This dataset is an export of transcriptions created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign Yours Truly, Frederick Douglass for 9,352 images from the Frederick Douglass Papers at the Library of Congress digital collection . The dataset contains volunteer-created and -reviewed text, volunteer-created tags, digital collections metadata, and metadata representing the arrangement of the items in…
Contributor:
By the People (Program) - Douglass, Frederick
Date:2025
Software, E-Resource
Transcription datasets from the Samuel J. Gibson Diary and Correspondence, Manuscript Division
By the People transcription campaign title : This Hell-upon-earth of a prison : Samuel J. Gibson's Andersonville Diary
This dataset is an export of transcriptions for 90 images from the Samuel J. Gibson Diary and Correspondence created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign, "This Hell-upon-earth of a Prison": Samuel J. Gibson's Andersonville Diary. It contains text created by volunteers through a transcription and review process, volunteer-created tags, digital collections metadata, and metadata representing…
Contributor:
Gibson, Samuel J. - By the People (Program)
Date:2020
Software, E-Resource
Transcription dataset from “such eventful times”: women and the American Civil War transcription campaign, Manuscript Division
By the People transcription campaign title : “Such eventful times”: women and the American Civil War
This dataset is an export of transcriptions for 4,670 images of materials representing Mary Ann Arnold, Mary Ann Bickerdyke, Betty Herndon Maury, and Dora Stephens, contained in the John Carvel Arnold Papers, Mary Ann Bickerdyke Papers, Betty Herndon Maury Maury Papers, and Alexander Hamilton Stephens Papers digital collections, respectively. The transcriptions were created by volunteers participating in the Library of Congress crowdsourcing program By…
Contributor:
By the People (Program)
Date:2024
Software, E-Resource
Transcription dataset from the James A. Garfield papers - diaries, Manuscript Division
By the People transcription campaign title : James A. Garfield diary: "his confidential friend"
This dataset is an export of transcriptions for 2,462 images from the James A. Garfield Papers digital collection created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign James A. Garfield Diary: “His Confidential Friend.” It contains text created by volunteers through a transcription and review process, volunteer-created tags, digital collections metadata, and metadata representing the arrangement of…
Contributor:
By the People (Program) - Garfield, James A. (James Abram)
Date:2024
Software, E-Resource
Transcription dataset from the George S. Patton papers - diaries, Manuscript Division
By the People transcription campaign title : War diaries of George S. Patton
This dataset is an export of transcriptions for 3,281 images from the George S. Patton Papers: Diaries digital collection created by staff transcribers through the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) staff campaign War Diaries of George S. Patton. It contains text created through a transcription and review process, tags, digital collections metadata, and metadata representing the arrangement of the items…
Contributor:
By the People (Program) - Patton, George S. (George Smith)
Date:2021
Software, E-Resource
Dataset from tribal leaders directory
Tribal leaders directory
"The Tribal Leaders Directory provides contact information for each federally recognized tribe. The electronic, map based, interactive directory also provides information about each BIA region and agency that provides services to a specific tribe. Additionally, the directory provides contact information for Indian Affairs leadership."--Directory website. Available in three formats: CSV, JSON, and XML. Archived by the Library of Congress May 2019. Description based on…
Contributor:
United States. Bureau of Indian Affairs
Date:2016-01-01
Software, E-Resource
Transcription dataset from the Benajah Jay Antrim journals, Manuscript Division
By the People transcription campaign title : Journey across Mexico, Benajah Jay Antrim journals and sketchbooks
This dataset is an export of transcriptions created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign Journey Across Mexico: Benajah Jay Antrim Journals and Sketchbooks for 517 images from the Benajah Jay Antrim Journals digital collection. It contains volunteer-created and -reviewed text, volunteer-created tags, digital collections metadata, and metadata representing the arrangement of the items in the…
Contributor:
Antrim, Benajah Jay - By the People (Program)
Date:2025
Book/Printed Material
Gateway National Recreation Area Sandy Hook Unit, Fort Hancock, Mess Hall Building #58, Monmouth County, New Jersey : historic structure report
Mess Hall Building #58, Fort Hancock, New Jersey, Sandy Hook Unit, Gateway National Recreation Area : historic structure report | Mess Hall Building #58, Fort Hancock, Gateway National Recreation Area
"March 2011." "Prepared for National Park Service Denver Service Center." Appendix D, Historical photographs, and appendix G, Historical documents, appear on the accompanying CD-ROMs. Includes bibliographical references (p. 176-178). Also available in digital form on the Library of Congress Web site.
Contributor:
United States. National Park Service. Denver Service Center - John Milner Associates - United States. National Park Service. Northeast Region
Date:2011-01-01
Software, E-Resource
Transcription datasets from Rosa Parks Papers, Manuscript Division
By the People transcription campaign title : Rosa Parks : in her own words
This dataset is an export of transcriptions for 1,769 images from the Rosa Parks Papers created by volunteers participating in the Library of Congress crowdsourcing program By the People (https://crowd.loc.gov) campaign, Rosa Parks: In Her Own Words. It contains text created by volunteers through a transcription and review process, volunteer-created tags, digital collections metadata, and metadata representing the arrangement of the items in the…
Contributor:
By the People (Program) - Parks, Rosa
Contributor:
Midwest Archeological Center (U.S.) - De Vore, Steven Leroy - Nickel, Robert K.
Date:2003
Map
Cshapes 2.0
Title from title screen (viewed on November 20, 2023). Dataset includes data files (CSV, Shapefile, GeoJSON). CShapes 2.0 maps the borders and capitals of independent states and dependent territories from 1886 to 2019.
Contributor:
Center for Comparative and International Studies (Zurich, Switzerland)