Chris Harrison, a Ph.D. student in Human-Computer Interaction at Carnegie Mellon, created an infographic of how “he” and “she” are used in Google’s digital books archive, which contains 200 years worth of published material. The graph shows the 120 most common words used after “he” and “she,” ordered in decreasing frequency.
When I asked Harrison what he found most interesting or surprising about the data, he responded (spoken like a true engineer): “Not any one thing was most interesting. As with many large data sets, there are many fascinating patterns. It is analogous to a single thread being rather unremarkable. From from many threads one can weave a tapestry.”
No comments:
Post a Comment