This week on DITA we learnt about distant reading and text analysis and used various online tools to analyse ext.
Distant reading is a form of reading where instead of focusing on an in-depth analysis of one text, many texts are analysed together as a dataset to understand them all. Text analysis as a form of distant reading by analysing large amounts of text for frequency of words appearing, patterns within the text and how often they are used in a particular context. There are various tools that can be used in text analysis and in our lab we tried out just a few to generate text clouds and I did it with an Altmetric report on how often articles about Gender were tweeted in Library and Information Science..
The first one is Wordle which a simple word cloud generator. It gives people the option of changing visuals such as font and colour as well as the number of words used in the cloud. At the most it is only capable of generating a visual of the words
The next one was Many-Eyes, which offers people a few more ways to visualise data besides word clouds including pie charts and graphs. However as much as I wanted to have a word cloud of of this again, it took a long time to get it to visualise one without it crashing. In terms of abilities I find it pretty similar to wordle however with the added choice when it comes to forms of visualisation. It still searches through text by frequency of appearance or alphabetically.
The final one and my personal favourite is Voyant. Voyant not only generates a word cloud but also offers many tools such as editing stop words so you can exclude words that you feel are irrelevant as well as see the number of times each word appears in the text..
Not only that, the user is able to pick and observe specific words. For example if I wanted to know how often science is a subject in the tweets then it can highlight and show where they occurred in the text as well as the context of those words. It could also compare them with different words on a chart and I compared it with the Internet as a way to see how often they appear together and where. Overall it is an effective tool for more detailed text analysis compared to the other two.