Analysis of Marked Up Documents | Florida State University Libraries

Top Right Search Box

Catalog Search   |   OneSearch   |   My Account

Analysis of Marked Up Documents

Please attend one of the iterations of the “Text Analysis with R” sessions before attending this workshop. If you cannot attend the prerequisite, contact Sarah Stanley for the slides and some test exercises to try before attending this session.

Often times, when we work with texts there are large amounts of extraneous text that we don’t want to deal with. We may want to suppress speaker labels in drama, advertisements in newspapers, or boilerplate language on a webpage. In this workshop, we will discuss how to extract specific textual features from XML and HTML documents using R. In our text exercises, we will explore the differences in results that we get from analyzing text that hasn’t been marked up and text that has.

Please bring a laptop to this session, and have R and RStudio installed, following the instructions in this LibGuide: Before attending, please install the “rvest” and “XML” packages in R, by going to R > “Install packages” in RStudio. If you have problems with installation, or if you do not have access to a laptop, please contact Sarah Stanley prior to the session.

Event Date: 
Monday, February 11, 2019 -
14:00 to 15:00
Event Location: 
Strozier Library, R&D Commons

About The Florida State University Libraries

The mission of the University Libraries is to support and enhance the learning, teaching, research, and service activities of the Florida State University...