What's the difference?: Textual analysis of variations in technical document structure
Vora, Aayushi; Rayani, Hiral; Carter, Jarai
Loading…
Permalink
https://hdl.handle.net/2142/99899
Description
Title
What's the difference?: Textual analysis of variations in technical document structure
Author(s)
Vora, Aayushi
Rayani, Hiral
Carter, Jarai
Contributor(s)
Rosenthal, Jacob
Issue Date
2018-04-24
Keyword(s)
Text analysis
Text mining
Text parsing
Technical document
XML
Abstract
The aim of this project is to analyze text variations in technical documents, provided by John Deere, to better understand how the documents are structured. Example of variations include synonymous titles and differences in content structure. The goals are to report the amount of deviations by the document texts from the mean, describe the anatomy of the documents, and identify commonalities between documents. This supports more structured standards to be drafted, allowing the data inside the documents to be more manageable. Python is the languages of choice for this project.
This is the default collection for all research and scholarship developed by faculty, staff, or students at the University of Illinois at Urbana-Champaign
Use this login method if you
don't
have an
@illinois.edu
email address.
(Oops, I do have one)
IDEALS migrated to a new platform on June 23, 2022. If you created
your account prior to this date, you will have to reset your password
using the forgot-password link below.