SlideShare ist ein Scribd-Unternehmen logo
1 von 74
Teaching with Google Books:
    research, copyright, and data
    mining

Nathan Rinne
Concordia University
Mar. 14, 2012
Library Technology Conference
Macalester College, St. Paul, MN.

A ll im a g e s a r e f a ir u s e o r
fro m th e
Short Description

Do you know about Google Books? Join an
exciting tour that will not only introduce the
Google Books Project and its history, but will
share ideas about using it as a springboard
to delve into issues like: a) data-mining; b)
copyright law; and c) research, both
personal and scholarly.

This presentation is based on a paper archived here:
http://hdl.handle.net/10760/16727
Outline

-Intro
Brief Google Book History and Tour

Understanding Copyright Law through
Google Books
Google Books and Research: the perks
and pitfalls
Google Books and the Digital Humanities

-Conclusion
Intro
   Themes of education, freedom and
    ethics interwoven in…
   Benjamin Franklin, on the effects of
    the growth of lending libraries:
    “These Libraries,” he wrote, “have improv’d
    the general Conversation of the Americans,
    made the common Tradesmen & Farmers as
    intelligent as most Gentlemen from other
    countries, and perhaps have contributed to
    some degree to the Stand so generally made http://en.wikipedia.org/wiki/File:B
                                                      enjamin_Franklin_by_Joseph-
    throughout the Colonies in Defence of their       Siffred_Duplessis.jpg
    Privileges.”
    Singer, Natasha. “Playing Catch-Up in a Digital Library Race.”
    New York Times, Jan. 8, 2011.
    http://www.nytimes.com/2011/01/09/business/09stream.html
Intro
“Knowledge is the                               My definition of knowledge:
common property of                              knowing how things
mankind.”                                       regularly transpire in the
                                                cosmos – and how these
                                                things can be understood
                                                (and perhaps harnessed) to
                                                help us move ever more
                                                successfully within it…




http://en.wikipedia.org/wiki/File:Thomas_Jeff
erson_by_Rembrandt_Peale,_1800.jpg
Intro
“Liberal arts” = arts “suitable
for a free man”
“the  areas of learning that
cultivate general intellectual ability
rather than technical or
professional skills. The term liberal
arts is often used as a synonym for
humanities, although the liberal
arts also include the sciences.”
   The New Dictionary of Cultural Literacy, Houghton Mifflin.   http://commons.wikimedia.org/wiki/File:Valenti
   Boston: Houghton Mifflin, 2002. s.v. "liberal arts,"
   http://www.credoreference.com/entry/hmndcl/liberal_arts
   (accessed March 02, 2012).
Intro
                                Google: “organizing the
                                world’s information and
                                making it universally
                                accessible and useful”
                                = instant gratification of our
                                information wants and
                                needs. It helps us to do
                                what we want… what we
www.flickr.com/photos           think is right… to freely
/72213316@N00/                  pursue the goals we think
3150692615/                     we should pursue.

So who can open the floodgates of
knowledge and education? Liberate?
Brief Google Book history and tour

“[book] information wants
to be free”




Depending on a book’s
copyright status, the full
text would be made
available freely online.
Brief Google Book history and tour




       “Part of core mission” What is the world of
       information without books?
                   Nunberg, Geoffrey. “Google Book Search: A Disaster for Scholars.” Chronicle of
                   Higher Education, August 31, 2009.
    http://chronicle.com/article/Googles-Book-Search-A/48245/
Brief Google Book history and tour

    Now they’ve got over 50
     other such libraries to help
     them
    2020 goal of digitizing 130
     million books (the amount
     they estimate exist)
Beck, Richard. “A bookshelf the Size of the World.” The
     Boston Globe (Boston) , July 24, 2011.
     http://articles.boston.com/2011-07-24/bostonglobe/29810463_1_google-books-robert-darnton-digitizatio

Sergey Brin (left) pic:
      http://en.wikipedia.org/wiki/File:Sergey_Brin_cropped.jpg
Larry Paige (right) pic: http://
      en.wikipedia.org/wiki/File:Larry_Page_laughs.jpg
Brief Google Book history and tour
     Michigan’s Paul Courant: no way that
      libraries could have done this alone. themselves….
     Mary Sue Coleman: project was a
      “legal, ethical and noble endeavor
      that will transform our society.”
     John P. Wilkin: “Things that can’t be found are not used.
      The things that are findable are used.”
Suber, Peter, “Michigan President Defends Google Library to AAP,” Open Access News: News from the Open Access
      Movement (blog), February 7, 2006 (8:46 a.m.),
      http://www.earlham.edu/~peters/fos/2006/02/michigan-president-defends-google.html, found originally here:
      Crawford, Walt. “Discovering Books: The OCA/GBS Saga Continues.” Cites & Insights: Crawford at Large 6, no. 6
      (Spring 2006). http://citesandinsights.info/v6i6a.htm ) ; Kellog, Sarah, “Going Public: A March Toward a National
      Digital Library”. DC Bar, November 2011. http://
      www.dcbar.org/for_lawyers/resources/publications/washington_lawyer/november_2011/digital_library.cfm
Brief Google Book history and tour

   Tour time: http://books.google.com/
   Note:
       “read”, “preview”, “snippet” and “no preview” books
       “Free Google eBooks” link
           Most “read” books pre-1923 (in public domain)
       “preview” and “snippet” – because of agreement with
        publisher, or…
        “orphans”….
       “no preview” –
        strict publishers
Brief Google Book history and tour

   Controversy: in-copyright but out-of-print book
    “snippets”. Sued by authors and publishers
   Orphans: who do they belong to?
   “Fair Use” defense -> settlement / book business
   $ 125 million registry to pay authors. Opt-out.
   Google gets to:
       show longer previews of most all of the out-of-print books
       allow persons to buy the books (print on-demand/e-books)
       show ads on the book pages online
       charge subscription fees to libraries and universities in
        order to access the full-text of the orphans
        “Judge Chin’s Ruling By the Number,” Open Book Alliance (blog), March 24, 2011.

        http://www.openbookalliance.org/2011/03/judge-chins-ruling-by-the-numbers/
Understanding Copyright Law through
Google Books
   Positives of the [revised] settlement
          Did what librarians/gov’t c/would have never
           done
          Access to millions of out-of-print but in-
           copyright books
          New life to old books!
          Service provided free of charge on at least
           one terminal in all public [and academic]
           libraries
    Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011, (11:00 a.m.),
           http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/
   Kolowich, Steve. “Please Refine Your Search Terms.” Inside Higher Ed, March 23, 2011.
    http://www.insidehighered.com/news/2011/03/23/judge_rejects_google_books_settlement

Understanding Copyright Law through
    Google Books
    Positives of the [revised] settlement
           Would be adapted to the needs of the visually
            impaired
           Data would be available for “large scale,
            quantitative research”
           Cuts down on expensive interlibrary loans – and
            help eliminate loans that disappoint
           “Authors and publishers [would] be able to
            cash in on long-neglected works”
     Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011, (11:00 a.m.),
            http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/
     Kolowich, Steve. “Please Refine Your Search Terms.” Inside Higher Ed, March 23, 2011.
     http://www.insidehighered.com/news/2011/03/23/judge_rejects_google_books_settlement
Understanding Copyright Law through
    Google Books
    Negatives of the [revised] settlement
         Opt-out clause for rights holders of out-of-print
          but copyright-protected books
         Foreign authors and publishers (U.K., Can. and
          Aus) not happy (international copyright law)
         Google would have exclusive protection vs.
          legal action by any rights holders who might
          come forth (who is the owner here?)
     Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011,
          (11:00 a.m.),
          http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/
Understanding Copyright Law through
    Google Books
    Negatives of the [revised] settlement
         Is the author’s guild (8,000 people) truly
          representative of all authors (6,800 authors
          opted out)?
         Many academics want their books to be free on
          GBS, so their ideas can be spread (no “Creative
          Commons” option)
         User privacy concerns (more on this later)
     Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011,
          (11:00 a.m.),
          http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/
Understanding Copyright Law through
    Google Books

    Big debate!                                What happened?
    The Economist:                             -Case thrown out.
     “The case has                              -Universal library
     stirred up                                   good…but this “too far”
     passions, conflict
                                                -Would have in effect
     and conspiracy
                                                  rewritten copyright law
     theories worthy
     of a literary                              -Congress’ job!
     blockbuster.”
                                       http://www.flickr.com/photos/mgifford/6117421227
“Google’s big book case,” The
     Economist. September 3,
     2009. http://
     www.economist.com/node/14363287
Understanding Copyright Law through
    Google Books
    Aftermath…
          Settlement can be revised (“opt in” necessary?)
          Author’s guild renewing lawsuit vs. Google and
           Hathi Trust



          Google (now): Author’s guild not sufficiently
           representative…
France-Presse, Agence. “U.S. Universities Hit with Copyright Infringement Suit”, The Raw Story
     (blog), September 12, 2011, (8:44 p.m.)
     http://www.rawstory.com/rs/2011/09/12/u-s-universities-hit-with-copyright-infringement-suit/
Coyle, Karen. “Google Files Motion to Dismiss.” Coyle’s InFormation (blog), December 26, 2011
     (2:16 p.m.), http://kcoyle.blogspot.com/2011/12/google-files-motion-to-dismiss.html
Understanding Copyright Law through
       Google Books

    Perils: “…dangers of placing all
     our information eggs in a
     private basket”.
    Darnton: “Google’s primary
     responsibility is to make money
     for its shareholders. Libraries
     exist to get books to readers…”

                                                           http://www.flickr.com/photos/mrs_logic/4875924
Darnton, Robert. "The Library: Three Jeremiads.” New
     York Review of Books. 57, no. 20: pp. 22-27.
Desai, Santosh. "Column: Are Books on Google Good for
     Us?" Financial Express, (Feb 02, 2010) n/a.
     http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/872800719?accountid=26720
     .
Understanding Copyright Law through
       Google Books

    Lawrence Lessig: Can’t rely on special favors
     from private companies…
    “…It is the environment for culture that the
     settlement will cement. [it turns] books into
     documentary film [where each clip must be
     purchased and renewed again and again]
     ….the deal constructs a world in which
     control can be exercised at the level of a
     page, and maybe even a quote. It is a world
     in which every bit, every published word,
                                                                             http://
     could be licensed.”                                                            www.flickr.com/photos/kub
Lessig, Lawrence. “For the Love of Culture.” The New Republic , January
     26, 2010 (12:00 am) http://www.tnr.com/print/article/the-love-culture
Understanding Copyright Law through
     Google Books

                                 Lessig: Pre-settlement, Google
                                  would have been victorious in
                                  court…project “sufficiently
                                  transformative” to be fair use.
                             Darnton: they should have made a

Fair Use logo                     robust case for fair use and tried to
http://                           set a legal precedent.
       wikimania2012.wikimedia.org/wiki/Main_Page
                   Lessig, Lawrence. “For the Love of Culture.” The New Republic , January 26,
                        2010 (12:00 am) http://www.tnr.com/print/article/the-love-culture
                   Whitebloom, Kenny. “Press: ‘Nothing Like it Has Ever Existed,” DPLA (blog),
                        January 18, 2012,
                        http://dp.la/2012/01/18/press-nothing-like-it-has-ever-existed/
Understanding Copyright Law through
       Google Books

    Copyright law purpose: “to promote the progress
     of science and the useful arts” (Act of 1790)
    Balancing “intellectual property” and “public
     domain”…
    Mattson: “Creativity requires stability…you can’t
     express yourself— write the book or article or
     teach the class—if you constantly worry about the
     next source of income… It’s about having time to
     reflect and think”.                                                        http://
                                                                                       en.wikipedia.org/wiki/File:G
    this kind of ownership goes hand in hand with
     people having the right to be paid for their work,
     and if they are not, trust in society decays.
Mattson, Kevin. "Paying the Piper: Is Culture Ever Free?." Dissent (00123846)
     58, no. 2 (Spring 2011): 69-73. Academic Search Premier, EBSCOhost
     (accessed March 9, 2012).
Understanding Copyright Law through
        Google Books

                                              “Overprotecting intellectual property
                                               is as harmful as underprotecting it.
                                               Culture is impossible without a rich
                                               public domain….overprotection stifles
                                               the very creative forces it's supposed to
                                               nurture.” – Judge Alex Kozinski
                                              Lessig: the free access that this [pre-
                                               commodification] world created is
http://
                                               an essential part of how we passed
                                               our culture along.
       en.wikipedia.org/wiki/File:Alex_Kozinski_cropped.jpg

                                      Dissenting in the White v. Samsung Elec. Am., Inc., 989 F.2d 1512
Alex Kozinski, Chief Judge of the
                                      (9th Cir. 1993) ruling.
     United States Court of
     Appeals for the Ninth Circuit,   Lessig, Lawrence. “For the Love of Culture.” The New Republic ,
                                      January 26, 2010 (12:00 am)
                                      http://www.tnr.com/print/article/the-love-culture
Understanding Copyright Law through
      Google Books

                        “Justice is a denial of mercy, and mercy is a denial of
                        justice. Only a higher force can reconcile these opposites:
                        wisdom. The problem cannot be solved, but wisdom can
                        transcend it. Similarly, societies need stability and
                        change, tradition and innovation, public interest and
                        private interest, planning and laissez-faire, order and
                        freedom, growth and decay. Everywhere society’s health
                        depends on the simultaneous pursuit of mutually opposed
http://                 activities or aims. The adoption of a final solution means
       en.wikipedia.org/wiki/File:SchumacherSiB200.jpg
                        a kind of death sentence for man’s humanity and spells
                        either cruelty or dissolution, generally both… Divergent
                        problems offend the logical mind.”

                  Schumacher, E. F. A Guide for the Perplexed. New York: Harper & Row, 1977,
                  127.
Understanding Copyright Law through
       Google Books

    “Google’s record suggests that it
     will not abuse its double-barreled
     fiscal-legal power… But what will
     happen if its current leaders sell the
     company or retire?”
    journals once were produced
     “solely in the spirit of free inquiry”…
                                                                http
    Need for a “Digital Public Lib. of                                ://www.flickr.com/photos/berkman

     America” (DPLA)
Thompson, Chris. "The Case Against Google Books; How three
    East Bay librarians led the revolt against the company's
    plans to archive all earthly knowledge." East Bay Express
    (California). October 14: LexisNexis Academic. Web.
    (accessed March 7, 2012).
Understanding Copyright Law through
     Google Books




   Digital Library to serve all Americans and beyond
   Would include many orphans and offer compensation
   Funded by grants, foundations and government
   Similar projects are taking place worldwide
   Google involved in efforts elsewhere and open to this to…
   Change the “ecology”…public good (not private gain)
Google Books and Research: the
perks and pitfalls

                              !
Thomas   Jefferson retirement library virtually
reconstructed with help from GBS (evidence of a book
transaction found in old journal….)
How does Google do it? New “popularity” algorithm.

Optical character recognition (OCR) tech, and metadata
from various sources…
Marlowe, L. “Washington diary.” Irish Times, Feb 26, 2011. P 18. Retrieved from
http://ezproxy.csp.edu/login?url=http://
search.proquest.com/docview/853784373?accountid=26720
Madrigal, Alexis. “Inside the Google Books Algorithm.” The Atlantic (blog), November 1, 2010,
(3:00 p.m.),
http://www.theatlantic.com/technology/archive/2010/11/inside-the-google-books-algorithm/65422
Google Books and Research: the
perks and pitfalls

                !
A newspaper columnist:
“Need Dutch oven history (my column two weeks ago)? It's
there.
Need first-person accounts of the Second Seminole War from
books published in the 1850s? They're there, too.
….From slave narratives to old travel guides to specialized
encyclopedias, Google Books can be a fantastic tool for the
historian or genealogist who is short on time to run to the
library.”
“Google books is a great source.” The Ledger, Jan 30, 2011. pp. n/a. Retrieved from
http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/847983062?accountid=26720
Google Books and Research: the
perks and pitfalls

                !
An academic:
“….now when doing research it is quite easy to track down
footnotes, whereas in the past one had to copy the reference
down, trudge over to the library, fill out an ILL slip, hope our
librarians found a library willing to lend a 150 year old book,
and then wait for it to arrive. Instead of weeks of hoping to
get a glimpse of a page, now often you can find things
instantly, delivered right to your desktop. (No, I don’t get
paid by Google for my posts)….”
Kloha, Jeff, “Words, Words, Words”, Concordia Theology (blog), December 21, 2010, (6:00
a.m.), http://concordiatheology.org/2010/12/words-words-words/
Google Books and Research: the
perks and pitfalls

          !
check  to see if a specific book covers something you’re
interested in
find out which books cite the journal article you are
interested in
cut down on interlibrary loan usage

discover rare texts and those with small print runs

highly granular searching: easily find historical
concepts that are not easily located using simple library
subject headings.
Google Books and Research: the
perks and pitfalls

           !
confirm   a quotation or see how a famous quote has
been used
discover unknown authors and works….

and of course… access to stuff that previously only
libraries had… (picking out the “best of the best” –
decades of collection development work by top-ranked
libraries…)
Google Books and Research: the
perks and pitfalls

         !
Check to see if a specific book covers
something you’re interested in:
     Use “search within the book” to find words,
      phrases or subjects in the book to see if the book
      will be useful…
     Will this book assist me in my research or
      collection building?
     Family history?
Google Books and Research: the
perks and pitfalls

           !
Find out which books cite the journal article
you are interested in
       Find out if a particular article was cited and
        commented on
       Use author’s last name, title of article and
        periodical
Google Books and Research: the
perks and pitfalls

          !
Cut   down on interlibrary loan usage
      Easily and inexpensively fill a request that
       otherwise would have not been possible (get
       quick PDF in hand!)
      May need to use advanced search functions
Google Books and Research: the
perks and pitfalls

                !
Discover         rare texts and those with small print runs
     “If you want information on the history of an area where an ancestor
     lived, type something like “History of Pike County, Illinois.” When I
     entered that term, I was shocked to learn that an 1880 book with the
     same name has been digitized and is available for free download at
     Google Books… one can easily search the book for people, localities
     and other key words.”
     Later, she writes, “Perhaps someone in your family…helped found an
     early church. When I entered the term ‘Baptists in Missouri,’ I learned
     that an 1882 book, ‘A History of the Baptists in Missouri,’ has been
     digitized and is in public domain.”
Meyer, Frankie. “Use Google as a resource for hard-to-find books” The Joplin Globe, February
27, 2012,
http://www.joplinglobe.com/lifestyles/x2118802287/Frankie-Meyer-Use-Google-as-a-resource-for-hard-t
Google Books and Research: the
perks and pitfalls

                !
Highly        granular searching
         easily find historical concepts that are not easily located using
          simple library subject headings
         One shared how she searched for the term “pin money” (money
          women had for spending in the 18th century )
         “Pin money” was not a subject heading, nor did it have a “see also
          heading”
         GBS quickly located several thousand results from the earliest
          appearances of the terms upward on.
Jackson, Millie. "Using Metadata to Discover the Buried Treasure in Google Book Search".
Journal of Library Administration. 47, no. ½ (2008): 165-173.
Google Books and Research: the
perks and pitfalls

                 http://www.flickr.com/photos/zachklein/54389823
            /-


No  authority control
OCR without human help

Flawed dates

Classification errors

Mismatched titles and authors

Gov doc issues, multi-volume issues, scanning
errors
Google Books and Research: the
        perks and pitfalls


   No authority control
        In library catalogs an author
         search for “Currer Bell” will
         re-direct you to the
         authorized heading “Bronte,
         Charlotte” (where can get all
         her books library has)
        GBS does not appear to
         utilize features like cross
         references and see-also
         references.
        No subject browsing! ->
Google Books and Research: the
perks and pitfalls


OCR     without human help
      “Enter the names of famous writers or public
       figures and restrict your search to works
       published before the year of their birth” – 29 hits
       for “Barack Obama” (in 2009)
      A search Google recommends on its Ngram
       viewer here. Why does “Abraham Lincoln” spike
       in the early 1800s?
  Nunberg, Geoffrey. “Google Book Search: A Disaster for Scholars.” Chronicle of Higher
      Education, August 31, 2009.
      http://chronicle.com/article/Googles-Book-Search-A/48245/
Google Books and Research: the
perks and pitfalls


Flawed          dates
         Published in 1899?                                                            etc…
Classification              errors
         Utilizes LCSH and BISAC…
         Moby Dick = Computers
         Cat Lover's Book of Fascinating Facts = Technology &
          Engineering
         Unbearable Weight: Feminism, Western Culture, and the
          Body (misdated 1899) = Health & Fitness
Nunberg, Geoffrey. “Google Book Search: A Disaster for Scholars.” Chronicle of Higher
Education, August 31, 2009. http://chronicle.com/article/Googles-Book-Search-A/48245/
Google Books and Research: the
perks and pitfalls

                   /-


Mismatched             titles and authors
      Madame Bovary by Henry James
      Mosaic Navigator: the essential                                http://
                                                                                en.wikipedia.org/wiki/File:Sigm
       guide to the Internet interface
       by Sigmund Freud and Katherine
       Jones.

  Pope J.T., and Holley R.P.(citing Nunberg) "Google book search
  and metadata". Cataloging and Classification Quarterly. 49, no. 1
  (2008): 1-13.
Google Books and Research: the
perks and pitfalls

                        /-


Gov’tdoc issues, multi-volume issues,
scanning errors
          Many gov’t documents are not available
          Cannot identify volume # in multi-volume works
          “Artistic” scans and scanning errors - see the site,
           The Art of Google Books
Pope J.T., and Holley R.P. "Google book
search and metadata". Cataloging and
Classification Quarterly. 49, no. 1 (2008):
1-13.
Google Books and Research: the
perks and pitfalls




Found here: http://
theartofgooglebooks.tumblr.com/post/18006886134/new-texts-created-when-read-through-burnt
                                          http
                                               ://theartofgooglebooks.tumblr.com/post/1789171
Google Books and Research: the
perks and pitfalls

   Scholars want to be able to…
       quickly locate multi-volume sets
       be able to quickly distinguish between various
        editions
       be able to count on accurate classification and
        headings, etc…
   Alternative: Hathi Trust
       Consortium of over 60 libs ; using Google scans
       More library tools ; for permanent curation
       Seeking out owners of orphans…
Google Books and Research: the
    perks and pitfalls

     “Quick and dirty” = “one ring
      to rule them all”
     Constant vigilance in being aware               http://
                                                en.wikipedia.org/wiki/File:Unico_Anello.png
      of information options?
     Geoffrey Nunberg: with its effective monopoly on the
      world’s only digital archive, researchers will come to
      depend on it, and they will assume Google’s got the details
      right… “Of course people will use it instead of their local
      library. Who wouldn’t? I use it all the time”.
Thompson, Chris. "The Case Against Google Books; How three East Bay librarians led the revolt against the
     company's plans to archive all earthly knowledge." East Bay Express (California). October 14: LexisNexis
     Academic. Web. (accessed March 7, 2012).
Google Books and Research: the
perks and pitfalls

    Hathi Trust problems:
         Authors Guild “noted that author J. R. Salamanca’s 1958
          novel The Lost Country was on the list of orphan books to
          be released by the consortium in October.”
         “…in a series of brief Web searches and telephone calls,
          found Salamanca, a professor emeritus at the University of
          Maryland, within minutes of starting the process”
         “extensive searches to find the original authors or
          copyright holders for all the orphan books scheduled for
          release”? Or no?
Kellog, Sarah, “Going Public: A March Toward a National Digital Library”. DC Bar, November
      2011.
      http://www.dcbar.org/for_lawyers/resources/publications/washington_lawyer/november_2011/digit
Google Books and the Digital
Humanities

    “Republic of Letters” clip (Digital humanities)
         Lots of human attention needed: more than scanning,
          OCR, and fancy algorithms to “mine” data
    Jon Orwant (Google), after attending
     conferences on digital humanities data
     mining: “I realized…we were sitting on this
     huge trove of value”.
Haven, Cynthia, “Stanford Technology Helps Scholars Get ‘Big Picture’ of the Enlightenment.”
      Stanford University News, December 17, 2009. http://
      news.stanford.edu/news/2009/december14/republic-of-letters-121809.html
Swift, Mike. 2010. "Google Books may Advance Humanities Research, Scholars Say."
      McClatchy - Tribune News Service, Aug 05, n/a.
      http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/737529948?accountid=2672
      .
Google Books and the Digital
     Humanities

    Two men from Harvard improved the Google Book
     Search dataset to show how useful it could be…
    Lieberman Aiden:
     “The goal is to give an 8-
     year old the ability to browse
     cultural trends throughout
     history, as recorded in books”.

Cohen, Patricia. "Google Database Puts
                                           Jean-Baptiste Michel and Eric
Language in a Petri Dish." International
                                                Lieberman Aiden presenting –
Herald Tribune, Dec 18, 2010: 12.
                                           http
                                                ://www.flickr.com/photos/ritterbin/591332
                                                /
Google Books and the Digital
Humanities

    Applying “high-turbro analysis to questions in the
     humanities”
    Called it “culturonomics” after “genomics”
    Google unveiled software in Dec. 16, 2010 and a paper
     by Lieberman Aiden and Michel and ten others released
     the same day…
    The Ngram viewer: Allows you do see the frequency of
     words or phrases over time – and the periods of times
     are statistically evened out (more books now than then)
Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December
    16, 2010. http://chronicle.com/article/Counting-on-Google-Books/125735/
Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published online):
    436-440. http://www.nature.com/news/2011/110617/full/474436a.html ,
    doi:10.1038/474436a
Google Books and the Digital
Humanities


    Take this prepared tour
    Try some of our own here:
      the decline of "propaganda" goes hand in hand

        with the rise of "Orwellian” (do these together and
        separate, and make sure to capitalize, as this is
        case-sensitive) ,
      “depression” overtakes “melancholy”, etc.

    Hours of fun, reflection…
Wile, Rob, “Google Books Reveals How Words Have Changed in Popularity Over Time”,
      Business Insider (blog), January 25, 2012, (8:19 a.m.),
      http://www.businessinsider.com/charts-google-books-reveals-the-most-popular-words-in-history-2
Google Books and the Digital
Humanities

    Google is giving grant money to scholars (history, sociology,
     linguistics, etc.) who want to use this dataset.
    One project in literature explained:
     Stanford professor of English and comparative literature
     Franco Moretti’s “team takes the Hardys and the Austens, the
     Thackerays and the Trollopes, and tosses their masterpieces
     into a database that contains hundreds of lesser novels. Then
     they cast giant digital nets into that megapot of words,
     trawling around like intelligence agents hunting for
     patterns in the chatter of terrorists.
     “Learning the algorithms that stitch together those nets is
     not typically part of an undergraduate English education….”
Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010.
     http://chronicle.com/article/The-Humanities-Go-Google/65713/
Google Books and the Digital
Humanities

Stuff they do:
Trace the novel going from an aristocratic literary form
to a more popular one: First names like “Jim” do not
appear before the 1870s, whereas before there were many
“Mr. Knightleys” and such.
Calculate how quickly irregular English verbs were
regularized – “chid” and “chode” went to “chided” in only
200 years (the “fastest verb to regularize”)
Haven, Cynthia, “Non-consumptive Research? Text Mining? Welcome to the Hotspot of
Humanities Research at Stanford” Stanford University News, December 1, 2010.
http://news.stanford.edu/news/2010/december/jockers-digitize-texts-120110.html
Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published online):
436-440. http://www.nature.com/news/2011/110617/full/474436a.html , doi:10.1038/474436a
Google Books and the Digital
Humanities
Stuff they do:
“Detect the suppression of the names of artists and
intellectual books published in Nazi Germany, the Stalinist
Soviet Union, and contemporary China”
Realize that writing in a specific literary genre is
“immediately restrictive of artistic freedom in ways
writers never would guess” – The “place-centered” genre of
Gothic novels “(think: castles, dark places) [show] a “marked
inclination” toward "locative prepositions"– "where," "at,"
"towards."
Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December
16, 2010. http://chronicle.com/article/Counting-on-Google-Books/125735/
Haven, Cynthia, “Non-consumptive Research? Text Mining? Welcome to the Hotspot of
Humanities Research at Stanford” Stanford University News, December 1, 2010. http://
news.stanford.edu/news/2010/december/jockers-digitize-texts-120110.html
Google Books and the Digital
Humanities

    Moretti: "It's like the invention of the telescope… All
     of a sudden, an enormous amount of matter
     becomes visible.”
    Implications: “…Culturonomics is clearly a discipline
     with a future, albeit one that hard to fathom for the
     time being.”
Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010.
      http://chronicle.com/article/The-Humanities-Go-Google/65713/
“Culturonomics and the Google Book Project,” The Physics arXiv Blog (blog), February 27,
      2012. http://www.technologyreview.com/blog/arxiv/27608/?p1=blogs
Google Books and the Digital
Humanities

    Concerns: humanities caught in the digital net?
    mindful of Seneca’s admonition that “too many
     books spoil the prof”, some humanities scholars are
     “apprehensive about the prospect of turning
     literary scholarship into an engineering
     problem”. - Geoffrey Nunberg
    “We know nothing can replace the balance of art
     and science that is the qualitative cornerstone of
     research in the humanities.” – John Orwant, Google
Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December
      16, 2010. http://chronicle.com/article/Counting-on-Google-Books/125735/
“Find Out What’s in a Word, or Five, with the Google Books Ngram Viewer,” Google Official
      Blog (blog), December 16, 2010, (1:08 p.m.)
      http://googleblog.blogspot.com/2010/12/find-out-whats-in-word-or-five-with.html
Google Books and the Digital
Humanities

Concerns:
With financial stress and waning student interest, will the “lure of
money and technology…. Increasingly push computation front and
center”?
“Will [it] come at the expense of traditional approaches” and “sweep
the deck of all money for humanities everywhere else"?
If things like the Ngram viewer are “the gateway drug that leads to
more-serious involvement in quantitative research” will humanities
scholars give appropriate attention to their traditional way of working?
Will scholars form “such a close relationship that the tools” that they
“only work with Google-supplied data sets”, getting locked-in?
Even if the “first generation” original thinkers like Moretti show promise,
what about “’dullard’ descendants [who] take up ‘distant reading’ for their
research?”
Google Books and the Digital
Humanities

Citations from previous page:

Parry,  Marc. “The Humanities Go Google.” Chronicle of Higher Education, May
28, 2010. http://chronicle.com/article/The-Humanities-Go-Google/65713/
Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published
online): 436-440. http://www.nature.com/news/2011/110617/full/474436a.html ,
doi:10.1038/474436a
Geoffrey Nunberg Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of
Higher Education, December 16, 2010.
http://chronicle.com/article/Counting-on-Google-Books/125735/
Parry, Marc. “Google Starts Grant Program for Studies of Its Digitized Books.”
Chronicle of Higher Education, March 31, 2010.
http://chronicle.com/article/Google-Starts-Grant-Program/64891/
Google Books and the Digital
   Humanities

Lieberman Aiden:
“You can read a small
number of books very
carefully. Or you can read
lots of books ‘very, very not-
carefully’“…
Hand, Eric. “Culturonomics: Word Play,” Nature
474, June 17, 2011 (published online): 436-440.
http://www.nature.com/news/2011/110617/full/474436a.html
                                                 http://
 , doi:10.1038/474436a                                  en.wikipedia.org/wiki/File:Erez_Lieberman_Aiden
Google Books and the Digital
Humanities

Marc Perry:
“Data-diggers    are gunning to debunk old claims based on
‘anecdotal’ evidence and answer once-impossible questions
about the evolution of ideas, language, and culture. Critics,
meanwhile, worry that these stat-happy quants take the
human out of the humanities. Novels aren't commodities like
bags of flour, they warn. Cranking words from deeply
specific texts like grist through a mill is a recipe for lousy
research, they say—and a potential disaster for the
profession.”
Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010.
http://chronicle.com/article/The-Humanities-Go-Google/65713/
Google Books and the Digital
Humanities

Privacy matters:
The same kind of data-mining that is
used in the Ngram viewer can also be
used to produce advertising portfolios on
those who read.
Google’s executive chairman Eric
Schmidt: “If you have something that
you don’t want anyone to know, maybe
you shouldn’t be doing it in the first
place.”
“Google Book Privacy Still a Concern Post GBS,” Open
Book Alliance (blog), October 27, 2011.                       http://
                                                                     en.wikipedia.org/wiki/File:Eric_S
http://www.openbookalliance.org/2011/04/google-book-privacy-still-a-concern-post-gbs/
Google Books and the Digital
Humanities
    Google recently united all of their privacy policies into one….
    “They know what you do online (Google Search), who you
     correspond with (Google Voice, Gmail, Google Plus), where you go
     (Google Maps), and what you do (Google Calendar). With the
     privacy policy change, Google will be using data-mining algorithms
     to combine these sources of personal information to create detailed
     profiles of their users.”




“Hide from Google”, Wired How-to Wiki (Wiki), Last modified: February 3, 2012 (10:30 p.m.)
     http://howto.wired.com/wiki/Hide_From_Google
Google Books and the Digital
    Humanities
    Battle between the science and the humanities (“Two
     Cultures” – C.P. Snow) taken to the next level…
    If “culturonomics” gains more and more of a foothold, on what basis will
     agreements and disagreements in the humanities increasingly be
     evaluated?
    Will they primarily be evaluated
     on the basis of who has the
     better algorithmic methods and
     scientific methodologies? Or will
     they primarily be evaluated on the basis of
     the human interpretation that is
     the result of many hours of study
     via real reading?                   Shakespeare, John Brockman, and C.P. Snow – http
                                                ://gloriamundi.blogsome.com/category/science-a
Google Books and the Digital
      Humanities

                        “Justice is a denial of mercy, and mercy is a denial of
                        justice. Only a higher force can reconcile these opposites:
                        wisdom. The problem cannot be solved, but wisdom can
                        transcend it. Similarly, societies need stability and
                        change, tradition and innovation, public interest and
                        private interest, planning and laissez-faire, order and
                        freedom, growth and decay. Everywhere society’s
                        health depends on the simultaneous pursuit of
http://                 mutually opposed activities or aims. The adoption of
       en.wikipedia.org/wiki/File:SchumacherSiB200.jpg
                        a final solution means a kind of death sentence for
                        man’s humanity and spells either cruelty or
                        dissolution, generally both… Divergent problems
                        offend the logical mind.”

                  Schumacher, E. F. A Guide for the Perplexed. New York: Harper & Row, 1977,
                  127.
Google Books and the Digital
     Humanities

    “Meaning has an extraordinary multiplicity
     that cannot be easily captured by the rigidly
     limited vocabularies of variables and standard
     methods”
     – Andrew Abbot.

    Remember: with Google Books you can indeed
     just read the books.
Andrew Abbott, “The Traditional Future: A Computational Theory of Library Research” [pre-print])
Google Books and the Digital
     Humanities

Better Tools…
Google Book search was built to sell ads against.

Ronald G. Musto:

     Google Books has represented to us that its massive digitization
     project…that would make the digital at least the equivalent…of print.
     It is, after all, the ‘public good,’ not the ‘public good enough,’ that lies
     behind all of Google Books' claims for fair-use rights to its digitization
     schemes.”
Musto, Ronald G. "Google Books Mutilates the Printed Past". Chronicle of Higher Education. 55, no. 39
(2009).
Google Books and the Digital
     Humanities

More Musto:
“…..Within the scholarly and nonprofit realm over the past decade,
there have been dozens of digitization projects: some small, some
massive, some open-access, some offered by subscription, some
successful, more not so. But several things have united them all: a
common purpose for the true good of the community, the highest
standards of quality in both technology and content, and a deep-
seated and long-abiding concern for the curation, and wide
dissemination, of our cultural heritage as a living process that goes
beyond commodification.”

Musto, Ronald G. "Google Books Mutilates the Printed Past". Chronicle of Higher Education. 55, no. 39
(2009).
Google Books and the Digital
      Humanities

For example:
Text Creation Partnership (TCP), University of Michigan

Corpus of Historical American English at BYU.
          Manually transcribe OCR scans
          TCP: “Structural tagging” allows computer “to see elements of
           the book such as paragraphs, typeface changes, and chapters”
          This metadata allows searches in introductions, summaries,
           quotations, etc.
          OCR cannot detect non-standard typefaces, some foreign
           languages, even italics.
See http://corpus.byu.edu/coha/compare-googleBooks.asp
Martin, Shawn. “To Google or Not to Google, That is the Question: Supplementing Google Book
Search to Make it More Useful for Scholarship.” Journal of Library Administration 47, no 1-2 (2008):
141-150.
Google Books and the Digital
     Humanities

Privacy:
might not the commitment that librarians have to user
privacy be a “selling point” we should tout – especially as
some people grow increasingly concerned about such
things?
Currently, Google’s new policy notes that it does not collect
user data from Google Books to combine with other
services, but it is difficult to see why this seemingly
arbitrary decision will stand.
Policy & Internet: 2, no. 4 (2010). http://www.psocommons.org/policyandinternet/vol2/iss4/art3/
DOI: 10.2202/1944-2866.1072
Law, Ifrah, “EPIC Unlikely to Prevail in Challenge to FTC Stance on Google Privacy,” JDSUPRA (blog),
February 24, 2012.
http://www.jdsupra.com/post/documentViewer.aspx?fid=29c4c5c1-4eec-4f14-8c9d-7b64a2dc3a87
Google Books and the Digital
     Humanities

    “An idea like Google Books represents both all that is
     wonderful and all that is terrifying about the digital
     revolution…. A knowledge society needs its information
     in a fluid, readily accessible and easily navigable form. It
     also needs diversity, freedom and the chaotic cadence of
     a million voices that sing their own determined tunes.
     The question before us is not an easy one. Either way,
     we will all win and we will all lose.”
Desai, Santosh. "Column: Are Books on Google Good for Us?" Financial Express, (Feb 02, 2010) n/a.
     http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/872800719?accountid=26720
     .


     
Google Books and the Digital
   Humanities
Closing:
Remember     what Google’s goals are.
As Siva Vaidhyanathan reminds us, we
are not actually consumers when it comes
to Google (those would be its advertisers),
but Google’s product.
Our interests and attention are what
Google utilizes and ultimately sells.
In addition to using Google for all that it
is worth, we may also want to redirect
our interests to some of the others
sources I’ve mentioned – and to see
their value as well.
Select Bibliography
(more citations found in endnotes of paper mentioned earlier)

Bivens-Tatum,    Wayne. “Libraries and the Commodification of Culture, Academic Librarian (blog), February 13, 2012,
http://blogs.princeton.edu/librarian/2012/02/libraries-and-the-commodification-of-culture/
Coyle, Karen. “Google Files Motion to Dismiss.” Coyle’s InFormation (blog), December 26, 2011 (2:16 p.m.),
http://kcoyle.blogspot.com/2011/12/google-files-motion-to-dismiss.html
Desai, Santosh. "Column: Are Books on Google Good for Us?" Financial Express, (Feb 02, 2010) n/a.
http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/872800719?accountid=26720.
Darnton, Robert. “A Library Without Walls,” NYR Blog (blog), October 4, 2010 (9:20 a.m.),
http://www.nybooks.com/blogs/nyrblog/2010/oct/04/library-without-walls/.
Darnton, Robert. “Can We Create a National Digital Library?” New York Review of Books, October 28, 2010,
http://www.nybooks.com/articles/archives/2010/oct/28/can-we-create-national-digital-library/
Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011, (11:00 a.m.),
http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/
Darnton, Robert. "The Library: Three Jeremiads.” New York Review of Books. 57, no. 20 (November 2010): pp. 22-27.

Efrati, Amir. “Judge Rejects Google Books Settlement.” Wall Street Journal, Mar. 23, 2011.
http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/858106644?accountid=26720
“Google Book Privacy Still a Concern Post GBS,” Open Book Alliance (blog), October 27, 2011.
http://www.openbookalliance.org/2011/04/google-book-privacy-still-a-concern-post-gbs/
 “Google’s Big Book Case.” Economist, September 3, 2009. http://www.economist.com/node/14363287

 Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published online): 436-440.
http://www.nature.com/news/2011/110617/full/474436a.html , doi:10.1038/474436a
 Haven, Cynthia, “Non-consumptive Research? Text Mining? Welcome to the Hotspot of Humanities Research at Stanford”
Stanford University News, December 1, 2010. http://news.stanford.edu/news/2010/december/jockers-digitize-texts-120110.html
    Howard, Jennifeer. "With No Google Books Deal, Libraries Push New Plans for Digital Access." Chronicle Of Higher
    Education 57, no. 30 (April 2011): A12. Academic Search Premier, EBSCOhost (accessed March 7, 2012)
   Jackson, Millie. "Using Metadata to Discover the Buried Treasure in Google Book Search". Journal of Library Administration.
    47, no. ½ (2008): 165-173.
   Kloha, Jeff, “Words, Words, Words”, Concordia Theology (blog), December 21, 2010, (6:00 a.m.),
    http://concordiatheology.org/2010/12/words-words-words/
   Lessig, Lawrence. “For the Love of Culture.” The New Republic , January 26, 2010 (12:00 am)
    http://www.tnr.com/print/article/the-love-culture
   Martin, Shawn. “To Google or Not to Google, That is the Question: Supplementing Google Book Search to Make it More
    Useful for Scholarship.” Journal of Library Administration 47, no 1-2 (2008): 141-150
   Mattson, Kevin. "Paying the Piper: Is Culture Ever Free?." Dissent (00123846) 58, no. 2 (Spring 2011): 69-73. Academic
    Search Premier, EBSCOhost (accessed March 9, 2012).
   Meyer, Frankie. “Use Google as a resource for hard-to-find books” The Joplin Globe, February 27, 2012,
    http://www.joplinglobe.com/lifestyles/x2118802287/Frankie-Meyer-Use-Google-as-a-resource-for-hard-to-find-books
   Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December 16, 2010.
    http://chronicle.com/article/Counting-on-Google-Books/125735/
   Nunberg, Geoffrey. “Google Book Search: A Disaster for Scholars.” Chronicle of Higher Education, August 31, 2009.
    http://chronicle.com/article/Googles-Book-Search-A/48245/
   Parry, Marc. “Google Starts Grant Program for Studies of Its Digitized Books.” Chronicle of Higher Education, March 31,
    2010. http://chronicle.com/article/Google-Starts-Grant-Program/64891/
   Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010.
    http://chronicle.com/article/The-Humanities-Go-Google/65713/
   Pope J.T., and Holley R.P. "Google book search and metadata". Cataloging and Classification Quarterly. 49, no. 1 (2008):
    1-13.
   Schumacher, E. F. A Guide for the Perplexed. New York: Harper & Row, 1977,127.
   Singer, Natasha. “Playing Catch-Up in a Digital Library Race.” New York Times, Jan. 8, 2011
   Swift, Mike. 2010. "Google Books may Advance Humanities Research, Scholars Say." McClatchy - Tribune News Service,
    Aug 05, n/a. http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/737529948?accountid=26720
   Thompson, Chris. "The Case Against Google Books; How three East Bay librarians led the revolt against the company's plans
    to archive all earthly knowledge." East Bay Express (California). October 14: LexisNexis Academic. Web. (accessed March 7,
    2012).
   “Tome Raider.” Economist, September 3, 2009. http://www.economist.com/node/14376406
   Wile, Rob, “Google Books Reveals How Words Have Changed in Popularity Over Time”, Business Insider (blog), January 25,
    2012, (8:19 a.m.), http://www.businessinsider.com/charts-google-books-reveals-the-most-popular-words-in-history-2012-1

Weitere ähnliche Inhalte

Was ist angesagt?

VRA 2009 (text version)
VRA 2009 (text version)VRA 2009 (text version)
VRA 2009 (text version)Michael Edson
 
Crkn agm oct 2009 google books settlement
Crkn agm oct 2009 google books settlementCrkn agm oct 2009 google books settlement
Crkn agm oct 2009 google books settlementTony Horava
 
Kindle garten, jeff bezos
Kindle garten, jeff bezosKindle garten, jeff bezos
Kindle garten, jeff bezosmpt001
 
The Social Web for Skeptics (or, Using the Social Web for Social Change)
The Social Web for Skeptics (or, Using the Social Web for Social Change)The Social Web for Skeptics (or, Using the Social Web for Social Change)
The Social Web for Skeptics (or, Using the Social Web for Social Change)Lauren Bacon
 
20100802 mdia5003 online_mobilemedia_presentation
20100802 mdia5003 online_mobilemedia_presentation20100802 mdia5003 online_mobilemedia_presentation
20100802 mdia5003 online_mobilemedia_presentationStephanieAnneBrown
 
Google as Predator: The Evolution of Search by David Sewell - BrightonSEO 2014
Google as Predator: The Evolution of Search by David Sewell - BrightonSEO 2014Google as Predator: The Evolution of Search by David Sewell - BrightonSEO 2014
Google as Predator: The Evolution of Search by David Sewell - BrightonSEO 2014Fresh Egg UK
 
Building Sites Around Social Objects - Web 2.0 Expo SF 2009
Building Sites Around Social Objects - Web 2.0 Expo SF 2009Building Sites Around Social Objects - Web 2.0 Expo SF 2009
Building Sites Around Social Objects - Web 2.0 Expo SF 2009Jyri Engeström
 
NewhouseSU COM 107 Communications and Society #NH1074Ward - Ch. 2 Slideshow
NewhouseSU COM 107 Communications and Society #NH1074Ward - Ch. 2 SlideshowNewhouseSU COM 107 Communications and Society #NH1074Ward - Ch. 2 Slideshow
NewhouseSU COM 107 Communications and Society #NH1074Ward - Ch. 2 SlideshowDr. William J. Ward
 
20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]Frederick Zarndt
 

Was ist angesagt? (11)

VRA 2009 (text version)
VRA 2009 (text version)VRA 2009 (text version)
VRA 2009 (text version)
 
Crkn agm oct 2009 google books settlement
Crkn agm oct 2009 google books settlementCrkn agm oct 2009 google books settlement
Crkn agm oct 2009 google books settlement
 
Web2 0storytelling 2009
Web2 0storytelling 2009Web2 0storytelling 2009
Web2 0storytelling 2009
 
Kindle garten, jeff bezos
Kindle garten, jeff bezosKindle garten, jeff bezos
Kindle garten, jeff bezos
 
The Social Web for Skeptics (or, Using the Social Web for Social Change)
The Social Web for Skeptics (or, Using the Social Web for Social Change)The Social Web for Skeptics (or, Using the Social Web for Social Change)
The Social Web for Skeptics (or, Using the Social Web for Social Change)
 
20100802 mdia5003 online_mobilemedia_presentation
20100802 mdia5003 online_mobilemedia_presentation20100802 mdia5003 online_mobilemedia_presentation
20100802 mdia5003 online_mobilemedia_presentation
 
Google as Predator: The Evolution of Search by David Sewell - BrightonSEO 2014
Google as Predator: The Evolution of Search by David Sewell - BrightonSEO 2014Google as Predator: The Evolution of Search by David Sewell - BrightonSEO 2014
Google as Predator: The Evolution of Search by David Sewell - BrightonSEO 2014
 
Leveraging Emerging Technologies for Learning (Science)
Leveraging Emerging Technologies for Learning (Science)Leveraging Emerging Technologies for Learning (Science)
Leveraging Emerging Technologies for Learning (Science)
 
Building Sites Around Social Objects - Web 2.0 Expo SF 2009
Building Sites Around Social Objects - Web 2.0 Expo SF 2009Building Sites Around Social Objects - Web 2.0 Expo SF 2009
Building Sites Around Social Objects - Web 2.0 Expo SF 2009
 
NewhouseSU COM 107 Communications and Society #NH1074Ward - Ch. 2 Slideshow
NewhouseSU COM 107 Communications and Society #NH1074Ward - Ch. 2 SlideshowNewhouseSU COM 107 Communications and Society #NH1074Ward - Ch. 2 Slideshow
NewhouseSU COM 107 Communications and Society #NH1074Ward - Ch. 2 Slideshow
 
20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]20130123 Crowdsourcing [hamilton library u of hi]
20130123 Crowdsourcing [hamilton library u of hi]
 

Andere mochten auch

45820888 tafsir-tarbawi
45820888 tafsir-tarbawi45820888 tafsir-tarbawi
45820888 tafsir-tarbawipopon matuqoh
 
Research Papers Recommender based on Digital Repositories Metadata
Research Papers Recommender based on Digital Repositories MetadataResearch Papers Recommender based on Digital Repositories Metadata
Research Papers Recommender based on Digital Repositories MetadataRicard de la Vega
 
Data mining paper survey for Health Care Support System
Data mining paper survey for Health Care Support SystemData mining paper survey for Health Care Support System
Data mining paper survey for Health Care Support System鴻鈞 王
 
Preservaçao digital de tese e dissertaçoes
Preservaçao digital de tese e dissertaçoesPreservaçao digital de tese e dissertaçoes
Preservaçao digital de tese e dissertaçoesRicard de la Vega
 
Research in data mining
Research in data miningResearch in data mining
Research in data miningHouw Liong The
 
Social Targeting: Understanding Social Media Data Mining & Analysis
Social Targeting: Understanding Social Media Data Mining & AnalysisSocial Targeting: Understanding Social Media Data Mining & Analysis
Social Targeting: Understanding Social Media Data Mining & AnalysisInfini Graph
 
Google File System
Google File SystemGoogle File System
Google File Systemnadikari123
 
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...shibbirtanvin
 
Mining the Social Web to Analyze the Impact of Social Media on Socialization,...
Mining the Social Web to Analyze the Impact of Social Media on Socialization,...Mining the Social Web to Analyze the Impact of Social Media on Socialization,...
Mining the Social Web to Analyze the Impact of Social Media on Socialization,...shibbirtanvin
 
Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareNUS-ISS
 
Ayat-Ayat Al-Qur’an tentang Etos Kerja
Ayat-Ayat Al-Qur’an tentang Etos KerjaAyat-Ayat Al-Qur’an tentang Etos Kerja
Ayat-Ayat Al-Qur’an tentang Etos KerjaNajib Faiq
 
Emotion detection from text using data mining and text mining
Emotion detection from text using data mining and text miningEmotion detection from text using data mining and text mining
Emotion detection from text using data mining and text miningSakthi Dasans
 
Clinical Decision Support Systems - Sunil Nair Health Informatics Dalhousie U...
Clinical Decision Support Systems - Sunil Nair Health Informatics Dalhousie U...Clinical Decision Support Systems - Sunil Nair Health Informatics Dalhousie U...
Clinical Decision Support Systems - Sunil Nair Health Informatics Dalhousie U...Sunil Nair
 
Recommender Systems - A Review and Recent Research Trends
Recommender Systems  -  A Review and Recent Research TrendsRecommender Systems  -  A Review and Recent Research Trends
Recommender Systems - A Review and Recent Research TrendsSujoy Bag
 
Hadoop HDFS Architeture and Design
Hadoop HDFS Architeture and DesignHadoop HDFS Architeture and Design
Hadoop HDFS Architeture and Designsudhakara st
 
Amazon Item-to-Item Recommendations
Amazon Item-to-Item RecommendationsAmazon Item-to-Item Recommendations
Amazon Item-to-Item RecommendationsRoger Chen
 

Andere mochten auch (17)

45820888 tafsir-tarbawi
45820888 tafsir-tarbawi45820888 tafsir-tarbawi
45820888 tafsir-tarbawi
 
Research Papers Recommender based on Digital Repositories Metadata
Research Papers Recommender based on Digital Repositories MetadataResearch Papers Recommender based on Digital Repositories Metadata
Research Papers Recommender based on Digital Repositories Metadata
 
Panel at AMIA 2013 Conference on big data - The Exposome and the quantified s...
Panel at AMIA 2013 Conference on big data - The Exposome and the quantified s...Panel at AMIA 2013 Conference on big data - The Exposome and the quantified s...
Panel at AMIA 2013 Conference on big data - The Exposome and the quantified s...
 
Data mining paper survey for Health Care Support System
Data mining paper survey for Health Care Support SystemData mining paper survey for Health Care Support System
Data mining paper survey for Health Care Support System
 
Preservaçao digital de tese e dissertaçoes
Preservaçao digital de tese e dissertaçoesPreservaçao digital de tese e dissertaçoes
Preservaçao digital de tese e dissertaçoes
 
Research in data mining
Research in data miningResearch in data mining
Research in data mining
 
Social Targeting: Understanding Social Media Data Mining & Analysis
Social Targeting: Understanding Social Media Data Mining & AnalysisSocial Targeting: Understanding Social Media Data Mining & Analysis
Social Targeting: Understanding Social Media Data Mining & Analysis
 
Google File System
Google File SystemGoogle File System
Google File System
 
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...
Preprocessing of Academic Data for Mining Association Rule, Presentation @WAD...
 
Mining the Social Web to Analyze the Impact of Social Media on Socialization,...
Mining the Social Web to Analyze the Impact of Social Media on Socialization,...Mining the Social Web to Analyze the Impact of Social Media on Socialization,...
Mining the Social Web to Analyze the Impact of Social Media on Socialization,...
 
Medical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in HealthcareMedical Informatics: Computational Analytics in Healthcare
Medical Informatics: Computational Analytics in Healthcare
 
Ayat-Ayat Al-Qur’an tentang Etos Kerja
Ayat-Ayat Al-Qur’an tentang Etos KerjaAyat-Ayat Al-Qur’an tentang Etos Kerja
Ayat-Ayat Al-Qur’an tentang Etos Kerja
 
Emotion detection from text using data mining and text mining
Emotion detection from text using data mining and text miningEmotion detection from text using data mining and text mining
Emotion detection from text using data mining and text mining
 
Clinical Decision Support Systems - Sunil Nair Health Informatics Dalhousie U...
Clinical Decision Support Systems - Sunil Nair Health Informatics Dalhousie U...Clinical Decision Support Systems - Sunil Nair Health Informatics Dalhousie U...
Clinical Decision Support Systems - Sunil Nair Health Informatics Dalhousie U...
 
Recommender Systems - A Review and Recent Research Trends
Recommender Systems  -  A Review and Recent Research TrendsRecommender Systems  -  A Review and Recent Research Trends
Recommender Systems - A Review and Recent Research Trends
 
Hadoop HDFS Architeture and Design
Hadoop HDFS Architeture and DesignHadoop HDFS Architeture and Design
Hadoop HDFS Architeture and Design
 
Amazon Item-to-Item Recommendations
Amazon Item-to-Item RecommendationsAmazon Item-to-Item Recommendations
Amazon Item-to-Item Recommendations
 

Ähnlich wie Teaching with Google Books: research, copyright, and data mining

Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...Ken Chad Consulting Ltd
 
What is fair use of 7TB?
What is fair use of 7TB?What is fair use of 7TB?
What is fair use of 7TB?paulcfyfe
 
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final RevisedTonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final RevisedYasar Tonta
 
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...Frederick Zarndt
 
HMID6303 Assignment 1 - Yeap
HMID6303 Assignment 1 - YeapHMID6303 Assignment 1 - Yeap
HMID6303 Assignment 1 - YeapYeap Aun
 
Openingandclosedsystems
OpeningandclosedsystemsOpeningandclosedsystems
OpeningandclosedsystemsFrancesca Lyn
 
Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008PrattSILS
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsJohn Breslin
 
googlization of information
googlization of informationgooglization of information
googlization of informationrajat00001in
 
Bookless Libraries
Bookless LibrariesBookless Libraries
Bookless LibrariesDheeraj Negi
 
Writ google glass arrangement
Writ google glass arrangementWrit google glass arrangement
Writ google glass arrangementwritRHET -
 
GLAM-WIKI presentation
GLAM-WIKI presentationGLAM-WIKI presentation
GLAM-WIKI presentationkentfitch
 
Being Engelbartian
Being EngelbartianBeing Engelbartian
Being EngelbartianJohn Bradley
 
Calhoun Rbms Rev June 2008
Calhoun Rbms Rev June 2008Calhoun Rbms Rev June 2008
Calhoun Rbms Rev June 2008Karen S Calhoun
 
Open Publishing in the Humanities
Open Publishing in the HumanitiesOpen Publishing in the Humanities
Open Publishing in the HumanitiesFrancesca Di Donato
 

Ähnlich wie Teaching with Google Books: research, copyright, and data mining (20)

Google Books Lecture
Google Books LectureGoogle Books Lecture
Google Books Lecture
 
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
Discovering Library2.0 Libraryservices For The Google Generation Sconul June ...
 
What is fair use of 7TB?
What is fair use of 7TB?What is fair use of 7TB?
What is fair use of 7TB?
 
Keynote: Revolution for Sure: Envisioning a 21st Century Information Organiza...
Keynote: Revolution for Sure: Envisioning a 21st Century Information Organiza...Keynote: Revolution for Sure: Envisioning a 21st Century Information Organiza...
Keynote: Revolution for Sure: Envisioning a 21st Century Information Organiza...
 
NISO Oct 16 Virtual Conference: Revolution or Evolution: The Organizational I...
NISO Oct 16 Virtual Conference: Revolution or Evolution: The Organizational I...NISO Oct 16 Virtual Conference: Revolution or Evolution: The Organizational I...
NISO Oct 16 Virtual Conference: Revolution or Evolution: The Organizational I...
 
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final RevisedTonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
Tonta World Is Flat Yet Not Open Oslo Workshop 10 May 2006 Final Revised
 
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
2013 ifla satellite zarndt et al [crowdsourcing the world's cultural heritage...
 
HMID6303 Assignment 1 - Yeap
HMID6303 Assignment 1 - YeapHMID6303 Assignment 1 - Yeap
HMID6303 Assignment 1 - Yeap
 
But is an Emerging Technologies Informationist a Librarian?
But is an Emerging Technologies Informationist a Librarian? But is an Emerging Technologies Informationist a Librarian?
But is an Emerging Technologies Informationist a Librarian?
 
Openingandclosedsystems
OpeningandclosedsystemsOpeningandclosedsystems
Openingandclosedsystems
 
Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008Pratt Sils Knowledge Organization Fall 2008
Pratt Sils Knowledge Organization Fall 2008
 
The Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for SemanticsThe Future of Social Networks on the Internet: The Need for Semantics
The Future of Social Networks on the Internet: The Need for Semantics
 
googlization of information
googlization of informationgooglization of information
googlization of information
 
Bookless Libraries
Bookless LibrariesBookless Libraries
Bookless Libraries
 
Writ google glass arrangement
Writ google glass arrangementWrit google glass arrangement
Writ google glass arrangement
 
GLAM-WIKI presentation
GLAM-WIKI presentationGLAM-WIKI presentation
GLAM-WIKI presentation
 
Being Engelbartian
Being EngelbartianBeing Engelbartian
Being Engelbartian
 
Calhoun Rbms Rev June 2008
Calhoun Rbms Rev June 2008Calhoun Rbms Rev June 2008
Calhoun Rbms Rev June 2008
 
Open Publishing in the Humanities
Open Publishing in the HumanitiesOpen Publishing in the Humanities
Open Publishing in the Humanities
 
Beyond Google: Learning Literacies in a Web 2.0 World
Beyond Google:  Learning Literacies in a Web 2.0 WorldBeyond Google:  Learning Literacies in a Web 2.0 World
Beyond Google: Learning Literacies in a Web 2.0 World
 

Kürzlich hochgeladen

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyAlfredo García Lavilla
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...Fwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024The Digital Insurer
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesZilliz
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 

Kürzlich hochgeladen (20)

"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024Advanced Test Driven-Development @ php[tek] 2024
Advanced Test Driven-Development @ php[tek] 2024
 
Commit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easyCommit 2024 - Secret Management made easy
Commit 2024 - Secret Management made easy
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?Are Multi-Cloud and Serverless Good or Bad?
Are Multi-Cloud and Serverless Good or Bad?
 
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks..."LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024My INSURER PTE LTD - Insurtech Innovation Award 2024
My INSURER PTE LTD - Insurtech Innovation Award 2024
 
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptxE-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
E-Vehicle_Hacking_by_Parul Sharma_null_owasp.pptx
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Vector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector DatabasesVector Databases 101 - An introduction to the world of Vector Databases
Vector Databases 101 - An introduction to the world of Vector Databases
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 

Teaching with Google Books: research, copyright, and data mining

  • 1. Teaching with Google Books: research, copyright, and data mining Nathan Rinne Concordia University Mar. 14, 2012 Library Technology Conference Macalester College, St. Paul, MN. A ll im a g e s a r e f a ir u s e o r fro m th e
  • 2. Short Description Do you know about Google Books? Join an exciting tour that will not only introduce the Google Books Project and its history, but will share ideas about using it as a springboard to delve into issues like: a) data-mining; b) copyright law; and c) research, both personal and scholarly. This presentation is based on a paper archived here: http://hdl.handle.net/10760/16727
  • 3. Outline -Intro Brief Google Book History and Tour Understanding Copyright Law through Google Books Google Books and Research: the perks and pitfalls Google Books and the Digital Humanities -Conclusion
  • 4. Intro  Themes of education, freedom and ethics interwoven in…  Benjamin Franklin, on the effects of the growth of lending libraries: “These Libraries,” he wrote, “have improv’d the general Conversation of the Americans, made the common Tradesmen & Farmers as intelligent as most Gentlemen from other countries, and perhaps have contributed to some degree to the Stand so generally made http://en.wikipedia.org/wiki/File:B enjamin_Franklin_by_Joseph- throughout the Colonies in Defence of their Siffred_Duplessis.jpg Privileges.” Singer, Natasha. “Playing Catch-Up in a Digital Library Race.” New York Times, Jan. 8, 2011. http://www.nytimes.com/2011/01/09/business/09stream.html
  • 5. Intro “Knowledge is the My definition of knowledge: common property of knowing how things mankind.” regularly transpire in the cosmos – and how these things can be understood (and perhaps harnessed) to help us move ever more successfully within it… http://en.wikipedia.org/wiki/File:Thomas_Jeff erson_by_Rembrandt_Peale,_1800.jpg
  • 6. Intro “Liberal arts” = arts “suitable for a free man” “the areas of learning that cultivate general intellectual ability rather than technical or professional skills. The term liberal arts is often used as a synonym for humanities, although the liberal arts also include the sciences.” The New Dictionary of Cultural Literacy, Houghton Mifflin. http://commons.wikimedia.org/wiki/File:Valenti Boston: Houghton Mifflin, 2002. s.v. "liberal arts," http://www.credoreference.com/entry/hmndcl/liberal_arts (accessed March 02, 2012).
  • 7. Intro Google: “organizing the world’s information and making it universally accessible and useful” = instant gratification of our information wants and needs. It helps us to do what we want… what we www.flickr.com/photos think is right… to freely /72213316@N00/ pursue the goals we think 3150692615/ we should pursue. So who can open the floodgates of knowledge and education? Liberate?
  • 8. Brief Google Book history and tour “[book] information wants to be free” Depending on a book’s copyright status, the full text would be made available freely online.
  • 9. Brief Google Book history and tour “Part of core mission” What is the world of information without books? Nunberg, Geoffrey. “Google Book Search: A Disaster for Scholars.” Chronicle of Higher Education, August 31, 2009. http://chronicle.com/article/Googles-Book-Search-A/48245/
  • 10. Brief Google Book history and tour  Now they’ve got over 50 other such libraries to help them  2020 goal of digitizing 130 million books (the amount they estimate exist) Beck, Richard. “A bookshelf the Size of the World.” The Boston Globe (Boston) , July 24, 2011. http://articles.boston.com/2011-07-24/bostonglobe/29810463_1_google-books-robert-darnton-digitizatio Sergey Brin (left) pic: http://en.wikipedia.org/wiki/File:Sergey_Brin_cropped.jpg Larry Paige (right) pic: http:// en.wikipedia.org/wiki/File:Larry_Page_laughs.jpg
  • 11. Brief Google Book history and tour  Michigan’s Paul Courant: no way that libraries could have done this alone. themselves….  Mary Sue Coleman: project was a “legal, ethical and noble endeavor that will transform our society.”  John P. Wilkin: “Things that can’t be found are not used. The things that are findable are used.” Suber, Peter, “Michigan President Defends Google Library to AAP,” Open Access News: News from the Open Access Movement (blog), February 7, 2006 (8:46 a.m.), http://www.earlham.edu/~peters/fos/2006/02/michigan-president-defends-google.html, found originally here: Crawford, Walt. “Discovering Books: The OCA/GBS Saga Continues.” Cites & Insights: Crawford at Large 6, no. 6 (Spring 2006). http://citesandinsights.info/v6i6a.htm ) ; Kellog, Sarah, “Going Public: A March Toward a National Digital Library”. DC Bar, November 2011. http:// www.dcbar.org/for_lawyers/resources/publications/washington_lawyer/november_2011/digital_library.cfm
  • 12. Brief Google Book history and tour  Tour time: http://books.google.com/  Note:  “read”, “preview”, “snippet” and “no preview” books  “Free Google eBooks” link  Most “read” books pre-1923 (in public domain)  “preview” and “snippet” – because of agreement with publisher, or… “orphans”….  “no preview” – strict publishers
  • 13. Brief Google Book history and tour  Controversy: in-copyright but out-of-print book “snippets”. Sued by authors and publishers  Orphans: who do they belong to?  “Fair Use” defense -> settlement / book business  $ 125 million registry to pay authors. Opt-out.  Google gets to:  show longer previews of most all of the out-of-print books  allow persons to buy the books (print on-demand/e-books)  show ads on the book pages online  charge subscription fees to libraries and universities in order to access the full-text of the orphans “Judge Chin’s Ruling By the Number,” Open Book Alliance (blog), March 24, 2011. http://www.openbookalliance.org/2011/03/judge-chins-ruling-by-the-numbers/
  • 14. Understanding Copyright Law through Google Books  Positives of the [revised] settlement  Did what librarians/gov’t c/would have never done  Access to millions of out-of-print but in- copyright books  New life to old books!  Service provided free of charge on at least one terminal in all public [and academic] libraries Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011, (11:00 a.m.), http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/  Kolowich, Steve. “Please Refine Your Search Terms.” Inside Higher Ed, March 23, 2011. http://www.insidehighered.com/news/2011/03/23/judge_rejects_google_books_settlement 
  • 15. Understanding Copyright Law through Google Books  Positives of the [revised] settlement  Would be adapted to the needs of the visually impaired  Data would be available for “large scale, quantitative research”  Cuts down on expensive interlibrary loans – and help eliminate loans that disappoint  “Authors and publishers [would] be able to cash in on long-neglected works” Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011, (11:00 a.m.), http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/ Kolowich, Steve. “Please Refine Your Search Terms.” Inside Higher Ed, March 23, 2011. http://www.insidehighered.com/news/2011/03/23/judge_rejects_google_books_settlement
  • 16. Understanding Copyright Law through Google Books  Negatives of the [revised] settlement  Opt-out clause for rights holders of out-of-print but copyright-protected books  Foreign authors and publishers (U.K., Can. and Aus) not happy (international copyright law)  Google would have exclusive protection vs. legal action by any rights holders who might come forth (who is the owner here?) Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011, (11:00 a.m.), http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/
  • 17. Understanding Copyright Law through Google Books  Negatives of the [revised] settlement  Is the author’s guild (8,000 people) truly representative of all authors (6,800 authors opted out)?  Many academics want their books to be free on GBS, so their ideas can be spread (no “Creative Commons” option)  User privacy concerns (more on this later) Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011, (11:00 a.m.), http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/
  • 18. Understanding Copyright Law through Google Books  Big debate!  What happened?  The Economist:  -Case thrown out. “The case has  -Universal library stirred up good…but this “too far” passions, conflict  -Would have in effect and conspiracy rewritten copyright law theories worthy of a literary  -Congress’ job! blockbuster.” http://www.flickr.com/photos/mgifford/6117421227 “Google’s big book case,” The Economist. September 3, 2009. http:// www.economist.com/node/14363287
  • 19. Understanding Copyright Law through Google Books  Aftermath…  Settlement can be revised (“opt in” necessary?)  Author’s guild renewing lawsuit vs. Google and Hathi Trust  Google (now): Author’s guild not sufficiently representative… France-Presse, Agence. “U.S. Universities Hit with Copyright Infringement Suit”, The Raw Story (blog), September 12, 2011, (8:44 p.m.) http://www.rawstory.com/rs/2011/09/12/u-s-universities-hit-with-copyright-infringement-suit/ Coyle, Karen. “Google Files Motion to Dismiss.” Coyle’s InFormation (blog), December 26, 2011 (2:16 p.m.), http://kcoyle.blogspot.com/2011/12/google-files-motion-to-dismiss.html
  • 20. Understanding Copyright Law through Google Books  Perils: “…dangers of placing all our information eggs in a private basket”.  Darnton: “Google’s primary responsibility is to make money for its shareholders. Libraries exist to get books to readers…” http://www.flickr.com/photos/mrs_logic/4875924 Darnton, Robert. "The Library: Three Jeremiads.” New York Review of Books. 57, no. 20: pp. 22-27. Desai, Santosh. "Column: Are Books on Google Good for Us?" Financial Express, (Feb 02, 2010) n/a. http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/872800719?accountid=26720 .
  • 21. Understanding Copyright Law through Google Books  Lawrence Lessig: Can’t rely on special favors from private companies…  “…It is the environment for culture that the settlement will cement. [it turns] books into documentary film [where each clip must be purchased and renewed again and again] ….the deal constructs a world in which control can be exercised at the level of a page, and maybe even a quote. It is a world in which every bit, every published word, http:// could be licensed.” www.flickr.com/photos/kub Lessig, Lawrence. “For the Love of Culture.” The New Republic , January 26, 2010 (12:00 am) http://www.tnr.com/print/article/the-love-culture
  • 22. Understanding Copyright Law through Google Books  Lessig: Pre-settlement, Google would have been victorious in court…project “sufficiently transformative” to be fair use.  Darnton: they should have made a Fair Use logo robust case for fair use and tried to http:// set a legal precedent. wikimania2012.wikimedia.org/wiki/Main_Page Lessig, Lawrence. “For the Love of Culture.” The New Republic , January 26, 2010 (12:00 am) http://www.tnr.com/print/article/the-love-culture Whitebloom, Kenny. “Press: ‘Nothing Like it Has Ever Existed,” DPLA (blog), January 18, 2012, http://dp.la/2012/01/18/press-nothing-like-it-has-ever-existed/
  • 23. Understanding Copyright Law through Google Books  Copyright law purpose: “to promote the progress of science and the useful arts” (Act of 1790)  Balancing “intellectual property” and “public domain”…  Mattson: “Creativity requires stability…you can’t express yourself— write the book or article or teach the class—if you constantly worry about the next source of income… It’s about having time to reflect and think”. http:// en.wikipedia.org/wiki/File:G  this kind of ownership goes hand in hand with people having the right to be paid for their work, and if they are not, trust in society decays. Mattson, Kevin. "Paying the Piper: Is Culture Ever Free?." Dissent (00123846) 58, no. 2 (Spring 2011): 69-73. Academic Search Premier, EBSCOhost (accessed March 9, 2012).
  • 24. Understanding Copyright Law through Google Books  “Overprotecting intellectual property is as harmful as underprotecting it. Culture is impossible without a rich public domain….overprotection stifles the very creative forces it's supposed to nurture.” – Judge Alex Kozinski  Lessig: the free access that this [pre- commodification] world created is http:// an essential part of how we passed our culture along. en.wikipedia.org/wiki/File:Alex_Kozinski_cropped.jpg Dissenting in the White v. Samsung Elec. Am., Inc., 989 F.2d 1512 Alex Kozinski, Chief Judge of the (9th Cir. 1993) ruling. United States Court of Appeals for the Ninth Circuit, Lessig, Lawrence. “For the Love of Culture.” The New Republic , January 26, 2010 (12:00 am) http://www.tnr.com/print/article/the-love-culture
  • 25. Understanding Copyright Law through Google Books “Justice is a denial of mercy, and mercy is a denial of justice. Only a higher force can reconcile these opposites: wisdom. The problem cannot be solved, but wisdom can transcend it. Similarly, societies need stability and change, tradition and innovation, public interest and private interest, planning and laissez-faire, order and freedom, growth and decay. Everywhere society’s health depends on the simultaneous pursuit of mutually opposed http:// activities or aims. The adoption of a final solution means en.wikipedia.org/wiki/File:SchumacherSiB200.jpg a kind of death sentence for man’s humanity and spells either cruelty or dissolution, generally both… Divergent problems offend the logical mind.” Schumacher, E. F. A Guide for the Perplexed. New York: Harper & Row, 1977, 127.
  • 26. Understanding Copyright Law through Google Books  “Google’s record suggests that it will not abuse its double-barreled fiscal-legal power… But what will happen if its current leaders sell the company or retire?”  journals once were produced “solely in the spirit of free inquiry”… http  Need for a “Digital Public Lib. of ://www.flickr.com/photos/berkman America” (DPLA) Thompson, Chris. "The Case Against Google Books; How three East Bay librarians led the revolt against the company's plans to archive all earthly knowledge." East Bay Express (California). October 14: LexisNexis Academic. Web. (accessed March 7, 2012).
  • 27. Understanding Copyright Law through Google Books  Digital Library to serve all Americans and beyond  Would include many orphans and offer compensation  Funded by grants, foundations and government  Similar projects are taking place worldwide  Google involved in efforts elsewhere and open to this to…  Change the “ecology”…public good (not private gain)
  • 28. Google Books and Research: the perks and pitfalls ! Thomas Jefferson retirement library virtually reconstructed with help from GBS (evidence of a book transaction found in old journal….) How does Google do it? New “popularity” algorithm. Optical character recognition (OCR) tech, and metadata from various sources… Marlowe, L. “Washington diary.” Irish Times, Feb 26, 2011. P 18. Retrieved from http://ezproxy.csp.edu/login?url=http:// search.proquest.com/docview/853784373?accountid=26720 Madrigal, Alexis. “Inside the Google Books Algorithm.” The Atlantic (blog), November 1, 2010, (3:00 p.m.), http://www.theatlantic.com/technology/archive/2010/11/inside-the-google-books-algorithm/65422
  • 29. Google Books and Research: the perks and pitfalls ! A newspaper columnist: “Need Dutch oven history (my column two weeks ago)? It's there. Need first-person accounts of the Second Seminole War from books published in the 1850s? They're there, too. ….From slave narratives to old travel guides to specialized encyclopedias, Google Books can be a fantastic tool for the historian or genealogist who is short on time to run to the library.” “Google books is a great source.” The Ledger, Jan 30, 2011. pp. n/a. Retrieved from http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/847983062?accountid=26720
  • 30. Google Books and Research: the perks and pitfalls ! An academic: “….now when doing research it is quite easy to track down footnotes, whereas in the past one had to copy the reference down, trudge over to the library, fill out an ILL slip, hope our librarians found a library willing to lend a 150 year old book, and then wait for it to arrive. Instead of weeks of hoping to get a glimpse of a page, now often you can find things instantly, delivered right to your desktop. (No, I don’t get paid by Google for my posts)….” Kloha, Jeff, “Words, Words, Words”, Concordia Theology (blog), December 21, 2010, (6:00 a.m.), http://concordiatheology.org/2010/12/words-words-words/
  • 31. Google Books and Research: the perks and pitfalls ! check to see if a specific book covers something you’re interested in find out which books cite the journal article you are interested in cut down on interlibrary loan usage discover rare texts and those with small print runs highly granular searching: easily find historical concepts that are not easily located using simple library subject headings.
  • 32. Google Books and Research: the perks and pitfalls ! confirm a quotation or see how a famous quote has been used discover unknown authors and works…. and of course… access to stuff that previously only libraries had… (picking out the “best of the best” – decades of collection development work by top-ranked libraries…)
  • 33. Google Books and Research: the perks and pitfalls ! Check to see if a specific book covers something you’re interested in:  Use “search within the book” to find words, phrases or subjects in the book to see if the book will be useful…  Will this book assist me in my research or collection building?  Family history?
  • 34. Google Books and Research: the perks and pitfalls ! Find out which books cite the journal article you are interested in  Find out if a particular article was cited and commented on  Use author’s last name, title of article and periodical
  • 35. Google Books and Research: the perks and pitfalls ! Cut down on interlibrary loan usage  Easily and inexpensively fill a request that otherwise would have not been possible (get quick PDF in hand!)  May need to use advanced search functions
  • 36. Google Books and Research: the perks and pitfalls ! Discover rare texts and those with small print runs “If you want information on the history of an area where an ancestor lived, type something like “History of Pike County, Illinois.” When I entered that term, I was shocked to learn that an 1880 book with the same name has been digitized and is available for free download at Google Books… one can easily search the book for people, localities and other key words.” Later, she writes, “Perhaps someone in your family…helped found an early church. When I entered the term ‘Baptists in Missouri,’ I learned that an 1882 book, ‘A History of the Baptists in Missouri,’ has been digitized and is in public domain.” Meyer, Frankie. “Use Google as a resource for hard-to-find books” The Joplin Globe, February 27, 2012, http://www.joplinglobe.com/lifestyles/x2118802287/Frankie-Meyer-Use-Google-as-a-resource-for-hard-t
  • 37. Google Books and Research: the perks and pitfalls ! Highly granular searching  easily find historical concepts that are not easily located using simple library subject headings  One shared how she searched for the term “pin money” (money women had for spending in the 18th century )  “Pin money” was not a subject heading, nor did it have a “see also heading”  GBS quickly located several thousand results from the earliest appearances of the terms upward on. Jackson, Millie. "Using Metadata to Discover the Buried Treasure in Google Book Search". Journal of Library Administration. 47, no. ½ (2008): 165-173.
  • 38. Google Books and Research: the perks and pitfalls http://www.flickr.com/photos/zachklein/54389823 /- No authority control OCR without human help Flawed dates Classification errors Mismatched titles and authors Gov doc issues, multi-volume issues, scanning errors
  • 39. Google Books and Research: the perks and pitfalls  No authority control  In library catalogs an author search for “Currer Bell” will re-direct you to the authorized heading “Bronte, Charlotte” (where can get all her books library has)  GBS does not appear to utilize features like cross references and see-also references.  No subject browsing! ->
  • 40. Google Books and Research: the perks and pitfalls OCR without human help  “Enter the names of famous writers or public figures and restrict your search to works published before the year of their birth” – 29 hits for “Barack Obama” (in 2009)  A search Google recommends on its Ngram viewer here. Why does “Abraham Lincoln” spike in the early 1800s? Nunberg, Geoffrey. “Google Book Search: A Disaster for Scholars.” Chronicle of Higher Education, August 31, 2009. http://chronicle.com/article/Googles-Book-Search-A/48245/
  • 41. Google Books and Research: the perks and pitfalls Flawed dates  Published in 1899? etc… Classification errors  Utilizes LCSH and BISAC…  Moby Dick = Computers  Cat Lover's Book of Fascinating Facts = Technology & Engineering  Unbearable Weight: Feminism, Western Culture, and the Body (misdated 1899) = Health & Fitness Nunberg, Geoffrey. “Google Book Search: A Disaster for Scholars.” Chronicle of Higher Education, August 31, 2009. http://chronicle.com/article/Googles-Book-Search-A/48245/
  • 42. Google Books and Research: the perks and pitfalls /- Mismatched titles and authors  Madame Bovary by Henry James  Mosaic Navigator: the essential http:// en.wikipedia.org/wiki/File:Sigm guide to the Internet interface by Sigmund Freud and Katherine Jones. Pope J.T., and Holley R.P.(citing Nunberg) "Google book search and metadata". Cataloging and Classification Quarterly. 49, no. 1 (2008): 1-13.
  • 43. Google Books and Research: the perks and pitfalls /- Gov’tdoc issues, multi-volume issues, scanning errors  Many gov’t documents are not available  Cannot identify volume # in multi-volume works  “Artistic” scans and scanning errors - see the site, The Art of Google Books Pope J.T., and Holley R.P. "Google book search and metadata". Cataloging and Classification Quarterly. 49, no. 1 (2008): 1-13.
  • 44. Google Books and Research: the perks and pitfalls Found here: http:// theartofgooglebooks.tumblr.com/post/18006886134/new-texts-created-when-read-through-burnt http ://theartofgooglebooks.tumblr.com/post/1789171
  • 45. Google Books and Research: the perks and pitfalls  Scholars want to be able to…  quickly locate multi-volume sets  be able to quickly distinguish between various editions  be able to count on accurate classification and headings, etc…  Alternative: Hathi Trust  Consortium of over 60 libs ; using Google scans  More library tools ; for permanent curation  Seeking out owners of orphans…
  • 46. Google Books and Research: the perks and pitfalls  “Quick and dirty” = “one ring to rule them all”  Constant vigilance in being aware http:// en.wikipedia.org/wiki/File:Unico_Anello.png of information options?  Geoffrey Nunberg: with its effective monopoly on the world’s only digital archive, researchers will come to depend on it, and they will assume Google’s got the details right… “Of course people will use it instead of their local library. Who wouldn’t? I use it all the time”. Thompson, Chris. "The Case Against Google Books; How three East Bay librarians led the revolt against the company's plans to archive all earthly knowledge." East Bay Express (California). October 14: LexisNexis Academic. Web. (accessed March 7, 2012).
  • 47. Google Books and Research: the perks and pitfalls  Hathi Trust problems:  Authors Guild “noted that author J. R. Salamanca’s 1958 novel The Lost Country was on the list of orphan books to be released by the consortium in October.”  “…in a series of brief Web searches and telephone calls, found Salamanca, a professor emeritus at the University of Maryland, within minutes of starting the process”  “extensive searches to find the original authors or copyright holders for all the orphan books scheduled for release”? Or no? Kellog, Sarah, “Going Public: A March Toward a National Digital Library”. DC Bar, November 2011. http://www.dcbar.org/for_lawyers/resources/publications/washington_lawyer/november_2011/digit
  • 48. Google Books and the Digital Humanities  “Republic of Letters” clip (Digital humanities)  Lots of human attention needed: more than scanning, OCR, and fancy algorithms to “mine” data  Jon Orwant (Google), after attending conferences on digital humanities data mining: “I realized…we were sitting on this huge trove of value”. Haven, Cynthia, “Stanford Technology Helps Scholars Get ‘Big Picture’ of the Enlightenment.” Stanford University News, December 17, 2009. http:// news.stanford.edu/news/2009/december14/republic-of-letters-121809.html Swift, Mike. 2010. "Google Books may Advance Humanities Research, Scholars Say." McClatchy - Tribune News Service, Aug 05, n/a. http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/737529948?accountid=2672 .
  • 49. Google Books and the Digital Humanities  Two men from Harvard improved the Google Book Search dataset to show how useful it could be…  Lieberman Aiden: “The goal is to give an 8- year old the ability to browse cultural trends throughout history, as recorded in books”. Cohen, Patricia. "Google Database Puts Jean-Baptiste Michel and Eric Language in a Petri Dish." International Lieberman Aiden presenting – Herald Tribune, Dec 18, 2010: 12. http ://www.flickr.com/photos/ritterbin/591332 /
  • 50. Google Books and the Digital Humanities  Applying “high-turbro analysis to questions in the humanities”  Called it “culturonomics” after “genomics”  Google unveiled software in Dec. 16, 2010 and a paper by Lieberman Aiden and Michel and ten others released the same day…  The Ngram viewer: Allows you do see the frequency of words or phrases over time – and the periods of times are statistically evened out (more books now than then) Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December 16, 2010. http://chronicle.com/article/Counting-on-Google-Books/125735/ Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published online): 436-440. http://www.nature.com/news/2011/110617/full/474436a.html , doi:10.1038/474436a
  • 51. Google Books and the Digital Humanities  Take this prepared tour  Try some of our own here:  the decline of "propaganda" goes hand in hand with the rise of "Orwellian” (do these together and separate, and make sure to capitalize, as this is case-sensitive) ,  “depression” overtakes “melancholy”, etc.  Hours of fun, reflection… Wile, Rob, “Google Books Reveals How Words Have Changed in Popularity Over Time”, Business Insider (blog), January 25, 2012, (8:19 a.m.), http://www.businessinsider.com/charts-google-books-reveals-the-most-popular-words-in-history-2
  • 52. Google Books and the Digital Humanities  Google is giving grant money to scholars (history, sociology, linguistics, etc.) who want to use this dataset.  One project in literature explained: Stanford professor of English and comparative literature Franco Moretti’s “team takes the Hardys and the Austens, the Thackerays and the Trollopes, and tosses their masterpieces into a database that contains hundreds of lesser novels. Then they cast giant digital nets into that megapot of words, trawling around like intelligence agents hunting for patterns in the chatter of terrorists. “Learning the algorithms that stitch together those nets is not typically part of an undergraduate English education….” Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010. http://chronicle.com/article/The-Humanities-Go-Google/65713/
  • 53. Google Books and the Digital Humanities Stuff they do: Trace the novel going from an aristocratic literary form to a more popular one: First names like “Jim” do not appear before the 1870s, whereas before there were many “Mr. Knightleys” and such. Calculate how quickly irregular English verbs were regularized – “chid” and “chode” went to “chided” in only 200 years (the “fastest verb to regularize”) Haven, Cynthia, “Non-consumptive Research? Text Mining? Welcome to the Hotspot of Humanities Research at Stanford” Stanford University News, December 1, 2010. http://news.stanford.edu/news/2010/december/jockers-digitize-texts-120110.html Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published online): 436-440. http://www.nature.com/news/2011/110617/full/474436a.html , doi:10.1038/474436a
  • 54. Google Books and the Digital Humanities Stuff they do: “Detect the suppression of the names of artists and intellectual books published in Nazi Germany, the Stalinist Soviet Union, and contemporary China” Realize that writing in a specific literary genre is “immediately restrictive of artistic freedom in ways writers never would guess” – The “place-centered” genre of Gothic novels “(think: castles, dark places) [show] a “marked inclination” toward "locative prepositions"– "where," "at," "towards." Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December 16, 2010. http://chronicle.com/article/Counting-on-Google-Books/125735/ Haven, Cynthia, “Non-consumptive Research? Text Mining? Welcome to the Hotspot of Humanities Research at Stanford” Stanford University News, December 1, 2010. http:// news.stanford.edu/news/2010/december/jockers-digitize-texts-120110.html
  • 55. Google Books and the Digital Humanities  Moretti: "It's like the invention of the telescope… All of a sudden, an enormous amount of matter becomes visible.”  Implications: “…Culturonomics is clearly a discipline with a future, albeit one that hard to fathom for the time being.” Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010. http://chronicle.com/article/The-Humanities-Go-Google/65713/ “Culturonomics and the Google Book Project,” The Physics arXiv Blog (blog), February 27, 2012. http://www.technologyreview.com/blog/arxiv/27608/?p1=blogs
  • 56. Google Books and the Digital Humanities  Concerns: humanities caught in the digital net?  mindful of Seneca’s admonition that “too many books spoil the prof”, some humanities scholars are “apprehensive about the prospect of turning literary scholarship into an engineering problem”. - Geoffrey Nunberg  “We know nothing can replace the balance of art and science that is the qualitative cornerstone of research in the humanities.” – John Orwant, Google Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December 16, 2010. http://chronicle.com/article/Counting-on-Google-Books/125735/ “Find Out What’s in a Word, or Five, with the Google Books Ngram Viewer,” Google Official Blog (blog), December 16, 2010, (1:08 p.m.) http://googleblog.blogspot.com/2010/12/find-out-whats-in-word-or-five-with.html
  • 57. Google Books and the Digital Humanities Concerns: With financial stress and waning student interest, will the “lure of money and technology…. Increasingly push computation front and center”? “Will [it] come at the expense of traditional approaches” and “sweep the deck of all money for humanities everywhere else"? If things like the Ngram viewer are “the gateway drug that leads to more-serious involvement in quantitative research” will humanities scholars give appropriate attention to their traditional way of working? Will scholars form “such a close relationship that the tools” that they “only work with Google-supplied data sets”, getting locked-in? Even if the “first generation” original thinkers like Moretti show promise, what about “’dullard’ descendants [who] take up ‘distant reading’ for their research?”
  • 58. Google Books and the Digital Humanities Citations from previous page: Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010. http://chronicle.com/article/The-Humanities-Go-Google/65713/ Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published online): 436-440. http://www.nature.com/news/2011/110617/full/474436a.html , doi:10.1038/474436a Geoffrey Nunberg Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December 16, 2010. http://chronicle.com/article/Counting-on-Google-Books/125735/ Parry, Marc. “Google Starts Grant Program for Studies of Its Digitized Books.” Chronicle of Higher Education, March 31, 2010. http://chronicle.com/article/Google-Starts-Grant-Program/64891/
  • 59. Google Books and the Digital Humanities Lieberman Aiden: “You can read a small number of books very carefully. Or you can read lots of books ‘very, very not- carefully’“… Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published online): 436-440. http://www.nature.com/news/2011/110617/full/474436a.html http:// , doi:10.1038/474436a en.wikipedia.org/wiki/File:Erez_Lieberman_Aiden
  • 60. Google Books and the Digital Humanities Marc Perry: “Data-diggers are gunning to debunk old claims based on ‘anecdotal’ evidence and answer once-impossible questions about the evolution of ideas, language, and culture. Critics, meanwhile, worry that these stat-happy quants take the human out of the humanities. Novels aren't commodities like bags of flour, they warn. Cranking words from deeply specific texts like grist through a mill is a recipe for lousy research, they say—and a potential disaster for the profession.” Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010. http://chronicle.com/article/The-Humanities-Go-Google/65713/
  • 61. Google Books and the Digital Humanities Privacy matters: The same kind of data-mining that is used in the Ngram viewer can also be used to produce advertising portfolios on those who read. Google’s executive chairman Eric Schmidt: “If you have something that you don’t want anyone to know, maybe you shouldn’t be doing it in the first place.” “Google Book Privacy Still a Concern Post GBS,” Open Book Alliance (blog), October 27, 2011. http:// en.wikipedia.org/wiki/File:Eric_S http://www.openbookalliance.org/2011/04/google-book-privacy-still-a-concern-post-gbs/
  • 62. Google Books and the Digital Humanities  Google recently united all of their privacy policies into one….  “They know what you do online (Google Search), who you correspond with (Google Voice, Gmail, Google Plus), where you go (Google Maps), and what you do (Google Calendar). With the privacy policy change, Google will be using data-mining algorithms to combine these sources of personal information to create detailed profiles of their users.” “Hide from Google”, Wired How-to Wiki (Wiki), Last modified: February 3, 2012 (10:30 p.m.) http://howto.wired.com/wiki/Hide_From_Google
  • 63. Google Books and the Digital Humanities  Battle between the science and the humanities (“Two Cultures” – C.P. Snow) taken to the next level…  If “culturonomics” gains more and more of a foothold, on what basis will agreements and disagreements in the humanities increasingly be evaluated?  Will they primarily be evaluated on the basis of who has the better algorithmic methods and scientific methodologies? Or will they primarily be evaluated on the basis of the human interpretation that is the result of many hours of study via real reading? Shakespeare, John Brockman, and C.P. Snow – http ://gloriamundi.blogsome.com/category/science-a
  • 64. Google Books and the Digital Humanities “Justice is a denial of mercy, and mercy is a denial of justice. Only a higher force can reconcile these opposites: wisdom. The problem cannot be solved, but wisdom can transcend it. Similarly, societies need stability and change, tradition and innovation, public interest and private interest, planning and laissez-faire, order and freedom, growth and decay. Everywhere society’s health depends on the simultaneous pursuit of http:// mutually opposed activities or aims. The adoption of en.wikipedia.org/wiki/File:SchumacherSiB200.jpg a final solution means a kind of death sentence for man’s humanity and spells either cruelty or dissolution, generally both… Divergent problems offend the logical mind.” Schumacher, E. F. A Guide for the Perplexed. New York: Harper & Row, 1977, 127.
  • 65. Google Books and the Digital Humanities  “Meaning has an extraordinary multiplicity that cannot be easily captured by the rigidly limited vocabularies of variables and standard methods” – Andrew Abbot.  Remember: with Google Books you can indeed just read the books. Andrew Abbott, “The Traditional Future: A Computational Theory of Library Research” [pre-print])
  • 66. Google Books and the Digital Humanities Better Tools… Google Book search was built to sell ads against. Ronald G. Musto: Google Books has represented to us that its massive digitization project…that would make the digital at least the equivalent…of print. It is, after all, the ‘public good,’ not the ‘public good enough,’ that lies behind all of Google Books' claims for fair-use rights to its digitization schemes.” Musto, Ronald G. "Google Books Mutilates the Printed Past". Chronicle of Higher Education. 55, no. 39 (2009).
  • 67. Google Books and the Digital Humanities More Musto: “…..Within the scholarly and nonprofit realm over the past decade, there have been dozens of digitization projects: some small, some massive, some open-access, some offered by subscription, some successful, more not so. But several things have united them all: a common purpose for the true good of the community, the highest standards of quality in both technology and content, and a deep- seated and long-abiding concern for the curation, and wide dissemination, of our cultural heritage as a living process that goes beyond commodification.” Musto, Ronald G. "Google Books Mutilates the Printed Past". Chronicle of Higher Education. 55, no. 39 (2009).
  • 68. Google Books and the Digital Humanities For example: Text Creation Partnership (TCP), University of Michigan Corpus of Historical American English at BYU.  Manually transcribe OCR scans  TCP: “Structural tagging” allows computer “to see elements of the book such as paragraphs, typeface changes, and chapters”  This metadata allows searches in introductions, summaries, quotations, etc.  OCR cannot detect non-standard typefaces, some foreign languages, even italics. See http://corpus.byu.edu/coha/compare-googleBooks.asp Martin, Shawn. “To Google or Not to Google, That is the Question: Supplementing Google Book Search to Make it More Useful for Scholarship.” Journal of Library Administration 47, no 1-2 (2008): 141-150.
  • 69. Google Books and the Digital Humanities Privacy: might not the commitment that librarians have to user privacy be a “selling point” we should tout – especially as some people grow increasingly concerned about such things? Currently, Google’s new policy notes that it does not collect user data from Google Books to combine with other services, but it is difficult to see why this seemingly arbitrary decision will stand. Policy & Internet: 2, no. 4 (2010). http://www.psocommons.org/policyandinternet/vol2/iss4/art3/ DOI: 10.2202/1944-2866.1072 Law, Ifrah, “EPIC Unlikely to Prevail in Challenge to FTC Stance on Google Privacy,” JDSUPRA (blog), February 24, 2012. http://www.jdsupra.com/post/documentViewer.aspx?fid=29c4c5c1-4eec-4f14-8c9d-7b64a2dc3a87
  • 70. Google Books and the Digital Humanities  “An idea like Google Books represents both all that is wonderful and all that is terrifying about the digital revolution…. A knowledge society needs its information in a fluid, readily accessible and easily navigable form. It also needs diversity, freedom and the chaotic cadence of a million voices that sing their own determined tunes. The question before us is not an easy one. Either way, we will all win and we will all lose.” Desai, Santosh. "Column: Are Books on Google Good for Us?" Financial Express, (Feb 02, 2010) n/a. http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/872800719?accountid=26720 . 
  • 71. Google Books and the Digital Humanities Closing: Remember what Google’s goals are. As Siva Vaidhyanathan reminds us, we are not actually consumers when it comes to Google (those would be its advertisers), but Google’s product. Our interests and attention are what Google utilizes and ultimately sells. In addition to using Google for all that it is worth, we may also want to redirect our interests to some of the others sources I’ve mentioned – and to see their value as well.
  • 72. Select Bibliography (more citations found in endnotes of paper mentioned earlier) Bivens-Tatum, Wayne. “Libraries and the Commodification of Culture, Academic Librarian (blog), February 13, 2012, http://blogs.princeton.edu/librarian/2012/02/libraries-and-the-commodification-of-culture/ Coyle, Karen. “Google Files Motion to Dismiss.” Coyle’s InFormation (blog), December 26, 2011 (2:16 p.m.), http://kcoyle.blogspot.com/2011/12/google-files-motion-to-dismiss.html Desai, Santosh. "Column: Are Books on Google Good for Us?" Financial Express, (Feb 02, 2010) n/a. http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/872800719?accountid=26720. Darnton, Robert. “A Library Without Walls,” NYR Blog (blog), October 4, 2010 (9:20 a.m.), http://www.nybooks.com/blogs/nyrblog/2010/oct/04/library-without-walls/. Darnton, Robert. “Can We Create a National Digital Library?” New York Review of Books, October 28, 2010, http://www.nybooks.com/articles/archives/2010/oct/28/can-we-create-national-digital-library/ Darnton, Robert. “Six Reasons Google Books Failed.” NYR Blog (blog), March 28, 2011, (11:00 a.m.), http://www.nybooks.com/blogs/nyrblog/2011/mar/28/six-reasons-google-books-failed/ Darnton, Robert. "The Library: Three Jeremiads.” New York Review of Books. 57, no. 20 (November 2010): pp. 22-27. Efrati, Amir. “Judge Rejects Google Books Settlement.” Wall Street Journal, Mar. 23, 2011. http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/858106644?accountid=26720 “Google Book Privacy Still a Concern Post GBS,” Open Book Alliance (blog), October 27, 2011. http://www.openbookalliance.org/2011/04/google-book-privacy-still-a-concern-post-gbs/  “Google’s Big Book Case.” Economist, September 3, 2009. http://www.economist.com/node/14363287  Hand, Eric. “Culturonomics: Word Play,” Nature 474, June 17, 2011 (published online): 436-440. http://www.nature.com/news/2011/110617/full/474436a.html , doi:10.1038/474436a  Haven, Cynthia, “Non-consumptive Research? Text Mining? Welcome to the Hotspot of Humanities Research at Stanford” Stanford University News, December 1, 2010. http://news.stanford.edu/news/2010/december/jockers-digitize-texts-120110.html
  • 73. Howard, Jennifeer. "With No Google Books Deal, Libraries Push New Plans for Digital Access." Chronicle Of Higher Education 57, no. 30 (April 2011): A12. Academic Search Premier, EBSCOhost (accessed March 7, 2012)  Jackson, Millie. "Using Metadata to Discover the Buried Treasure in Google Book Search". Journal of Library Administration. 47, no. ½ (2008): 165-173.  Kloha, Jeff, “Words, Words, Words”, Concordia Theology (blog), December 21, 2010, (6:00 a.m.), http://concordiatheology.org/2010/12/words-words-words/  Lessig, Lawrence. “For the Love of Culture.” The New Republic , January 26, 2010 (12:00 am) http://www.tnr.com/print/article/the-love-culture  Martin, Shawn. “To Google or Not to Google, That is the Question: Supplementing Google Book Search to Make it More Useful for Scholarship.” Journal of Library Administration 47, no 1-2 (2008): 141-150  Mattson, Kevin. "Paying the Piper: Is Culture Ever Free?." Dissent (00123846) 58, no. 2 (Spring 2011): 69-73. Academic Search Premier, EBSCOhost (accessed March 9, 2012).  Meyer, Frankie. “Use Google as a resource for hard-to-find books” The Joplin Globe, February 27, 2012, http://www.joplinglobe.com/lifestyles/x2118802287/Frankie-Meyer-Use-Google-as-a-resource-for-hard-to-find-books  Nunberg, Geoffrey. “Counting on Google Books.” Chronicle of Higher Education, December 16, 2010. http://chronicle.com/article/Counting-on-Google-Books/125735/  Nunberg, Geoffrey. “Google Book Search: A Disaster for Scholars.” Chronicle of Higher Education, August 31, 2009. http://chronicle.com/article/Googles-Book-Search-A/48245/  Parry, Marc. “Google Starts Grant Program for Studies of Its Digitized Books.” Chronicle of Higher Education, March 31, 2010. http://chronicle.com/article/Google-Starts-Grant-Program/64891/  Parry, Marc. “The Humanities Go Google.” Chronicle of Higher Education, May 28, 2010. http://chronicle.com/article/The-Humanities-Go-Google/65713/  Pope J.T., and Holley R.P. "Google book search and metadata". Cataloging and Classification Quarterly. 49, no. 1 (2008): 1-13.
  • 74. Schumacher, E. F. A Guide for the Perplexed. New York: Harper & Row, 1977,127.  Singer, Natasha. “Playing Catch-Up in a Digital Library Race.” New York Times, Jan. 8, 2011  Swift, Mike. 2010. "Google Books may Advance Humanities Research, Scholars Say." McClatchy - Tribune News Service, Aug 05, n/a. http://ezproxy.csp.edu/login?url=http://search.proquest.com/docview/737529948?accountid=26720  Thompson, Chris. "The Case Against Google Books; How three East Bay librarians led the revolt against the company's plans to archive all earthly knowledge." East Bay Express (California). October 14: LexisNexis Academic. Web. (accessed March 7, 2012).  “Tome Raider.” Economist, September 3, 2009. http://www.economist.com/node/14376406  Wile, Rob, “Google Books Reveals How Words Have Changed in Popularity Over Time”, Business Insider (blog), January 25, 2012, (8:19 a.m.), http://www.businessinsider.com/charts-google-books-reveals-the-most-popular-words-in-history-2012-1

Hinweis der Redaktion

  1. We’ll talk about Google Books at it pertains to all of these things…
  2. … . but we know, as did Franklin, that historically much knowledge had been the privilege of the few.
  3. Knowledge is power because it allows for freedom….
  4. So who can open the floodgates of knowledge and education to all people that they may thrive and flourish? Who can be the liberator and champion of the people? Is it not Google? (we’ll get to Google Books specifically soon) How is it not Google? For let me define “Google”. Google is not only “organizing the world’s information and making it universally accessible and useful”. It is, practically speaking, the instant gratification of our information needs and wants. It helps us to do what we want… what we think is right… to freely pursue the goals we think we should pursue. Does Google not mean freedom? How can you doubt that it does not? After all, let me tell you about the Google Books project…. (Having fun yet? Surely I jest a bit, but hopefully in the service of making serious points…eventually)
  5. Sir-gay  
  6. They have some 15 million completed already
  7. Google does in a week what we could do in a year… Availability of massive collections of curated materials…
  8. “ read”, “preview”, “snippet” and “no preview” books “ Free Google eBooks” link Most “read” books pre-1923 (in public domain) “ preview” and “snippet” – because of agreement with publisher, or… “ orphans”…. “ no preview” – strict publishers
  9. All involved: librarians, scholars, publishers, authors, techies…..
  10. Robert Darnton, noting how price-gouging academic journals once had been produced “solely in the spirit of free inquiry”, said, “Google’s record suggests that it will not abuse its double-barreled fiscal-legal power… But what will happen if its current leaders sell the company or retire?
  11. Robert Darnton, noting how price-gouging academic journals once had been produced “solely in the spirit of free inquiry”, said, “Google’s record suggests that it will not abuse its double-barreled fiscal-legal power… But what will happen if its current leaders sell the company or retire?
  12. Robert Darnton, noting how price-gouging academic journals once had been produced “solely in the spirit of free inquiry”, said, “Google’s record suggests that it will not abuse its double-barreled fiscal-legal power… But what will happen if its current leaders sell the company or retire?
  13. Hard stuff. I think that this quote can at least give us an idea about how to think about these things
  14. Only good for Google’s reputation….
  15. take into account web search frequency, recent book sales, the number of libraries that hold the title, and how often an older book has been reprinted.
  16. There can be no doubt, that as Millie Jackson says, “the transformation of the way we work as scholars and researchers is tremendous”. How can we not cry out “Freedom!”?
  17. … and vanity searches might feel even more rewarding in GBS : )
  18. Though I confess I have no idea if this is something that figures into the searches of our new discovery tools either…
  19. to teach critical thinking not only about these concepts in history but the changing nature of language itself
  20. to teach critical thinking not only about these concepts in history but the changing nature of language itself
  21. to teach critical thinking not only about these concepts in history but the changing nature of language itself
  22. to teach critical thinking not only about these concepts in history but the changing nature of language itself
  23. OK – we’ll come back to this stuff….
  24. Just in case you’ve remained blissfully ignorant of all this…
  25. Just in case you’ve remained blissfully ignorant of all this…
  26. Hard stuff. I think that this quote can at least give us an idea about how to think about these things
  27. In sum, as Martin puts it, the question is when it is appropriate to use the Google [Books] product for text searching [and] when it is appropriate to use other products for searching”.
  28. It is always a double-edged sword with Google. Hopefully, as we look towards the future, we will continue thinking hard about the best way to work with all the changes that have come our way – and to act as responsibly as we can.