Show simple item record

dc.contributor.authorJofre, Ana
dc.contributor.authorBerardi, Vincent
dc.contributor.authorBrennan, Kathleen P.J.
dc.contributor.authorCornejo, Aisha
dc.contributor.authorBennett, Carl
dc.contributor.authorHarlan, John
dc.date.accessioned2022-04-14T22:28:53Z
dc.date.available2022-04-14T22:28:53Z
dc.date.issued2020-03
dc.identifier.citationJofre, Ana, Vincent Berardi, Kathleen P.J. Brennan, Aisha Cornejo, Carl Bennett, and John Harlan. 2020. “Crowdsourcing Image Extraction and Annotation: Software Development and Case Study.” Digital Humanities Quarterly 14 (2). http://www.digitalhumanities.org/dhq/vol/14/2/000469/000469.html.en_US
dc.identifier.urihttp://www.digitalhumanities.org/dhq/vol/14/2/000469/000469.html
dc.identifier.urihttp://hdl.handle.net/20.500.12648/7156
dc.description.abstractWe describe the development of web-based software that facilitates large-scale, crowdsourced image extraction and annotation within image-heavy corpora that are of interest to the digital humanities. An application of this software is then detailed and evaluated through a case study where it was deployed within Amazon Mechanical Turk to extract and annotate faces from the archives of Time magazine. Annotation labels included categories such as age, gender, and race that were subsequently used to train machine learning models. The systemization of our crowdsourced data collection and worker quality verification procedures are detailed within this case study. We outline a data verification methodology that used validation images and required only two annotations per image to produce high-fidelity data that has comparable results to methods using five annotations per image. Finally, we provide instructions for customizing our software to meet the needs for other studies, with the goal of offering this resource to researchers undertaking the analysis of objects within other image-heavy archives.en_US
dc.language.isoen_USen_US
dc.publisherDigital Humanities Quarterlyen_US
dc.subjectCrowdsourcingen_US
dc.subjectImage extraction and annotationen_US
dc.subjectCrowdsourcing softwareen_US
dc.subjectTime Magazineen_US
dc.subjectData verification methodologyen_US
dc.subjectAmazon Mechanical Turk (AMT)en_US
dc.titleCrowdsourcing Image Extraction and Annotation: Software Development and Case Studyen_US
dc.typeArticle/Reviewen_US
dc.source.journaltitleDigital Humanities Quarterlyen_US
dc.description.versionVoRen_US
dc.description.institutionSUNY Polytechnic Instituteen_US
dc.description.departmentCommunications and Humanities Departmenten_US
dc.description.degreelevelN/Aen_US


This item appears in the following Collection(s)

Show simple item record