Hiding culture: Google Books & Snippets

In May 2006 I was overjoyed with Google books. I retold an anecdote where I was able to find a book after watching a documentary

[It] was mentioned briefly in a documentary tonight and it sparked my curiousity. So I looked for the book, searching the online databases of second hand bookstores. No luck. Then, almost as a joke, I googled it. And there it was on google books. Cool but it was not like I was going to read it online. Then I saw the download button. Within minutes of hearing of the book for the first time I had a pdf of it on my computer – Google books is too cool!

Amazing, fantastic, brilliant… but. There is a tendency to forget that Google is not a neutral infrastructure and therefore has no real desire to preserve and make available books – even when they have scanned them.

Dingodog led me to this problem via the PD-discussion list

Googlebooks has scanned tons of PUBLIC DOMAIN BOOKS, but not all PUBLIC
DOMAIN BOOKS are accessible

there are a tons of PUBLIC DOMAIN BOOKS that Goglebooks non only has
left in snippet view, but that refuses to UNLOCK for full view (and they
are in PUBLIC DOMAIN!!!)

I have a sad story to tell, about these public domain books left in
snippet view:

since 2009 I’m complaining about a same set of PUBLIC DOMAIN BOOKS whose
copyright expired since more than 10-20 years

I sent a mail every month, then every week, and all this during last
three years to report the erroneous classification of public domain
books as snippet view. Now I’m considering to send a mail every day, but
I’m not confident about effects; I think Googlebooks will ignore my
complaints, as it has ignored during these years

it seems, in fact, that Googlebooks has absolutely no will to unlock,
even if user (as I have done) provides well documented biographical info
and cites the laws regulating the status of book in different countries

People complained about public domain books left in snippet view and
googlebooks user forum was full of these complaints with google
employees not only unable to unlock (maybe google not provided this
ability to employees, or it simply ignored requests), but seriously
lacking in knowledge of googlebooks structure

The discussion continues on the list but it is terribly important to know that scanning is not preservation and does not mean access. Additionally when Google makes these choices it is increasingly important to know this.

Another question I find interesting is the question of multiple copies. Will Google care enough to make multiple scanned copies available? Will we be able to see the errors and additions in certain volumes or not?

Sure the originals are still around but the problem is that with the convenience of Google people will forget this and focus on what is available online. Also the availability of Google books will prevent more rigorous projects from being carried out.

The Extroverted Reader: Notes from a lecture

Actually the lecture was called “The Extroverted Reader” and looked at the ways in which ebook readers are changing the ways in which we consume culture.

Beginning with a bit of history: The technology of writing began about 5000 years ago (Unfortunately in my slides I’m off by a millennium) by the Sumerians. By 2000 BC the Phoenicians had a form of writing – but it did not contain many of the elements we rely upon today:

fndtlvsnvrydctngvrytmsmbdytrnsnthstgntththrrmndrdbk

For example, 1000 years late the Greeks had added vowels

ifindtelevisionveryeducatingeverytimesomebodyturnsonthesetigointotheotherroomandreadabook

and the plays of Aristophanes (446 BC – ca. 386 BC) had punctuation

ifindtelevisionveryeducating.everytimesomebodyturnsontheset,igointotheotherroomandreadabook.

Mixing lower and upper case appeared 700 AD

Ifindtelevisionveryeducating.Everytimesomebodyturnsontheset,Igointotheotherroomandreadabook.

and the humble spaces between words seems to have been developed in Ireland in 900 AD

I find television very educating. Every time somebody turns on the set, I go into the other room and read a book.

But since then not much has happened. Sure we have changes in materials, production and business models. But the printing press was not an innovation in text – it was an amazing innovation in lowering production costs. So basically for the next millennium not much happened.

It was not until we began to go digital that we realized that we had the potential to fundamentally change the way in which we read. But things did not come overnight and it was not until the 1970s that we cracked electronic paper. This development was fundamental to the development of the ebook reader. The next challenge is to find a point at which to start looking at the developments in the field. Here is my timeline: 1993 Apple Newton, 1999 Franklin EB-500 Rocket eBook, 2002 TabletPC, 2004 Sony Libré, 2006 eReader PRS-500, 2007 iPhone, 2007 Kindle, 2009 Nook, 2010 iPad, 2011 Kindle fire.

This was followed by a brief section on the control of media – the ways in which books could be controlled in the past in relation to how they can be controlled in the present. What you can and cannot read depends on those who control the technology of reading. Prior to ebooks this control was a question of distribution. Here we can see examples of the requirement of censors to permit the printing of books through the system of imprimatur or the attempts to create lists of forbidden books such as the Index Librorum Prohibitorum of the Catholic Church. These analogue controls have their strengths and weaknesses but they are naturally imperfect controls.

No matter how effective the controls were, they were no match to the control demonstrated by Amazon when it remotely deleted some digital editions of the George Orwell’s 1984 from the Kindle devices of readers who had bought them (NYT, June 2009). This act shined a clear light on one of the fundamental questions of ebooks – what is it we have actually bought when we buy an ebook? What do we own or have a right to use? Is the content of our reader ours?

This area is fascinating but what my talk was going to focus on was the issue of connectivity in relation to the reader so I moved along to the growth of connectivity in reading. Reading is always a social activity, in its most basic form the reader is connecting with the writer. We are also connected, in some form, to others who have read the same material as us. By reading similar works we create a common culture and understanding. Our common experiences enable us to have a common starting point in many discussions. This is true of all cultural expressions. Today saying things like double-dip or tie-fighter evoke common ideas and shared experiences but what would these words have meant to someone in 1970?

Today sharing is all the vogue and the technology of choice is social media. There are naturally critics to our new behaviors. Some critics see the end of human culture (Keen 2007) to the rewiring of the physical brain. For the latter I like to use Professor Greenfield who has been quoted as saying:

“My fear is that these technologies are infantilising the brain into the state of small children who are attracted by buzzing noises and bright lights, who have a small attention span and who live for the moment.”

There is a general criticism that we are using our devices to ensure that we have a constant stream stimuli and the fear is that this will prevent us from having “real” experiences. My take on this is that we are losing certain aspects and gaining others… for example I have written about the negative side of the loss of real boredom in our lives.

By connecting social media and reading we are attempting to ensure that the reader is not unconnected from the rest of the world. One part of this is the highlighting function in readers. First the highlighter is a mimic of analogue technology. We need to be able to highlight sections of text in order to find them again. But this is quickly used in new and exciting ways. First we can share our markings so that while reading you can be informed that 3 readers have highlighted this section. This effects our reading, we want to be accepted by others and not to stand out – so we look more carefully at these sections. Secondly, if we highlight a section this information is passed along but it may not be enough- we are asked whether we would like to share what we are reading. Why? Its all part of the development of performance lifestyle. Of course we want to share our deepest browsing, we need to show that we are extraordinary in some fashion.

All this data is gathered and analyzed. As is the data of which books you buy, when you buy them, when you start reading them, when you stop reading them, where in the book you pause or start and if you actually finish the book. Sure, you may actually have thick books in your bookshelf but in the future your bookseller will know how long it took you to read them – if you ever did!

This is the interesting thing. While we are buying our books we are also taking part in a much larger process where we are providing information about our deepest and most solitary habits. Someone is really reading over your shoulder but they don’t want you content – they want your habits.

The next section looked at changes in the marketplace as the ways in which we read will naturally change the ways in which we create and sell books. Amazon already knows what you browse and what you do, or do not buy, they allow us to write reviews and to use functions such as Facebook’s Like button. Not to mention the ways in which they are using interesting varying pricing strategies to get use to impulse buy. Buying is easier and does not require physical activity or waiting for delivery. This increases the content we have available to us.

Our content is swelled even further buy alternative book markets such as self publishing projects or Project Gutenberg which has 36 000 books available to us. Stop and think about that number! That’s a huge amount of books. Add to this the pirated books which can be downloaded illegally. These alternatives provide any reader with an endless supply of books. Endless if they are intended to be read as well as downloaded.

So in closing I wanted to address the point of the lecture: What will the endless library do to our individual reading patterns, to our collective cultures, to our language, to our libraries?

– Access to endless amounts of books will change the ways in which we read. We will demand more for less from our authors. Readers will generally have less tolerance for the slow read and will want more bangs for their bucks. Writers wanting to achieve large scale fame will have to adapt to this. Publishers will demand they do. Publishers will also know (based from reader data) where and when readers stopped reading and will attempt to “fix” this.

– Our culture will no longer be defined by a common canon of literature but we will become more splintered into interests. Naturally we will still be dominated by the bestsellers but below that we will all read our own interests in a way that we have not seen in books (but we have seen this in magazines and music).

– Many non-English publishers have been attempting to retain control over their markets by excluding or limiting the ebook from their languages. But this is not a long-term solution as self-publishing will force them to change. If not there is the possibility that the smaller languages will suffer (maybe disappear?), especially in the countries where English skills are good.

– Our libraries are often seen by outsiders as bastions of conservatism. This is very much the outsiders view. Librarians are the first to adopt and change. They do not see themselves as repositories for physical books but as place of information exchange. This will continue to develop but it is important that the image of the library as the silent, dusty pile of books and the librarian as the old spinster must change in order for librarians to succeeded in their metamorphosis.

One thing is certain. Culture is inevitable even if copyright is not. Technology will not kill our culture even if the business models which we are used to seeing today may not be around tomorrow. The reader will remain even if trees are no longer killed to feed her habits.

The slides for my presentation are online here.

What are you writing for #AcBoWriMo

Following the good example of Emily and the Lime presenting what we are writing for AcBoWriMo.

My project is to write a book about Online Identity (this is a very crappy working title). Since finishing my PhD (effects of technology on regulation of Democracy) I have had several ideas for longer works. All of these ideas have crashed and burned due to lack of time and other good excuses. So when I came across AcBoWriMo at the same time I had yet another idea for a book it was time to jump in with both feet.

The idea of the book is the way’s in which technological change are forcing changing attitudes to the concept of identity. In particular I will look at the ways in which regulation and protection of elements of identity are being affected by these changes. The fundamental idea is that we have previously agreed upon loosely defined and understood ideas of identity and their protection but these ideas and protections are being challenged (blown away almost) by the ways in which we use technology. The book will show the ways in which regulation fails and attempt to describe why this failure occurs. This is not really as clear as it should be but I am right now not focusing on defining the overarching idea of the whole book but building it from the bottom up with each chapter exploring different (though naturally related) changes.

For me the project began with a mindmap – (ugly version below) – and will in 18 working days reach 36 000 words in November and somewhere between 80 ooo – 100 000 words by mid January. For me AcBoWriMo is a welcome kick up the backside in forcing the launch of the writing project (no more excuses) and a pleasurable way in working alongside others – is this a form of misery loves company?

So what are you writing?

The Martini Method

Writing can be hard, boring, lonely work. We need all the help we can get. I just came across the Martini Method (via Academic Productivity) and feel instantaneously its my kind of carrot and whip!

What I call the Martini Method is named after an anecdote I once read about the novelist Anthony Burgess (of Clockwork Orange fame). Burgess was a very productive writer, which is attributed to a system where he would force himself to write a 1000 words a day, 365 days a year. When he had completed his word count, he would relax with a dry martini, and enjoy the rest of the day with an easy conscience, and normally in bar. A friend of mine’s version of the Martini Method was to come into the office everyday, and not allow herself to leave until her word target had been reached. Most days she left before 5pm, though on occasion she would stay as late as 6 or 7. She would also set herself mini Martinis, such as allowing herself an ice cream in the summer once she had hit half her daily word count. Though we started at the same time, she finished her PhD a lot earlier than me!

Peter Langmar's Cultural Sphere and Public Interest

Reading and enjoying Peter Langmar‘s masters thesis Cultural Sphere and Public Interest: Combining Free and Participatory Culture, Cultural Democracy and Critiques of Value Regimes to Rethink Policy, Artistic and Institutional Practices

Here is part of the abstract:

The thesis aims at a holistic and multidisciplinary redefinition of public interest in the cultural sphere, contextualised in the democratic and cosmopolitan era. The thesis reveals various problems and weaknesses of the cultural sphere by combining a wide variety of concepts and discourses such as critiques of: high and mass culture, aesthetics, monopolistic competition, hegemonic value and copyrights regimes. In other words the thesis merges the critiques of the oligopolistic actors, of the hegemonic copyright and value regimes of the cultural sphere.

Peter’s work is theoretically a bit heavy – it even includes a section on future research in the form of a PhD research proposal – but its highly readable and is well worth the time. So far I am most pleased with the ways in which he connects free culture, cultural democracy and participatory culture.

As it’s licensed under Creative Commons BY NC SA I have added it to the texts of interests in my growing open licensed collection of works of interest.

Wikipedia Reader: new free book

Another book has been added to my growing hoard of CC licensed works that are somehow relevant to my research area.

The Critical Point of View: A Wikipedia Reader is an interesting work featuring research from a large group of exciting and original thinkers. It is, as the blurb states:

About the book: For millions of internet users around the globe, the search for new knowledge begins with Wikipedia. The encyclopedia’s rapid rise, novel organization, and freely offered content have been marveled at and denounced by a host of commentators. Critical Point of View moves beyond unflagging praise, well-worn facts, and questions about its reliability and accuracy, to unveil the complex, messy, and controversial realities of a distributed knowledge platform.

Right now the chapters which have my interest are

The Argument Engine by Joseph Reagle, What is an Encyclopedia? From Pliny to Wikipedia by Dan O’Sullivan
A Brief History of the Internet from the 15th to the 18th Century by Lawrence Liang, Questioning Wikipedia by Nicholas Carr, The Missing Wikipedians by Heather Ford, and The Right to Fork: A Historical Survey of De/centralization in Wikipedia by Andrew Famiglietti. But this is only a small fraction of the topics covered in this work.

So check out: Geert Lovink and Nathaniel Tkacz (eds), Critical Point of View: A Wikpedia Reader, Amsterdam: Institute of Network Cultures, 2011. Its available in online, pdf, or good old dead tree versions!

Also if there are other titles of CC licensed books which should be included in the list please let me know…

The end of Swedish?

Today I did something unusual… I bought a book! Well the book in itself is not unusual but what was different today was the fact that the book was old fashioned analogue – you know… re-used, old dead trees.

When it was launched I was anti-Kindle, in November 2007 I even wrote:

For me it doesn’t matter how fancy schmancy the details are – and Kindle has some impressive details. The dead tree with ink stains still remains my clear favorite.

But eventually I succumbed and bought one by the end of 2010. Almost immediately my reading and purchasing patterns changed drastically – this became very obvious when the book Själens medium: Skrift och subjekt i Nordeuropa omkring 1500 by Götselius was not available in digital format… and did not buy it!

Most of the time this is not a problem as most of my reading is in English. But this has an interesting side-effect: publishers in small language groups seem to think that staying out of the Kindle market is a smart way of maintaining control over their market. But the problem is that this market is diminishing. Given a choice – the Kindle user is almost forced into the a larger language group.

Sure, I was forced to buy a book today but that’s still 15 less than I would have during the same period.

Needed to be said

If you can keep your head when all about you
Are losing theirs and blaming it on you,
If you can trust yourself when all men doubt you,
But make allowance for their doubting too;
If you can wait and not be tired by waiting,
Or being lied about, don’t deal in lies,
Or being hated, don’t give way to hating,
And yet don’t look too good, nor talk too wise:

If you can dream – and not make dreams your master;
If you can think – and not make thoughts your aim;
If you can meet with Triumph and Disaster
And treat those two impostors just the same;
If you can bear to hear the truth you’ve spoken
Twisted by knaves to make a trap for fools,
Or watch the things you gave your life to, broken,
And stoop and build ’em up with worn-out tools:

If you can make one heap of all your winnings
And risk it on one turn of pitch-and-toss,
And lose, and start again at your beginnings
And never breathe a word about your loss;
If you can force your heart and nerve and sinew
To serve your turn long after they are gone,
And so hold on when there is nothing in you
Except the Will which says to them: ‘Hold on!’

If you can talk with crowds and keep your virtue,
‘ Or walk with Kings – nor lose the common touch,
if neither foes nor loving friends can hurt you,
If all men count with you, but none too much;
If you can fill the unforgiving minute
With sixty seconds’ worth of distance run,
Yours is the Earth and everything that’s in it,
And – which is more – you’ll be a Man, my son!

If by Rudyard Kipling & It just needed to be said right now.

What libraries should protect

In the digital age the idea of being concerned about someone knowing which books we read may seem strange. But as a matter of principle I feel it should be important that this kind of information is not saved. Very often we hear the argument: if you have done nothing wrong, then you have nothing to hide.

That argument is so stupid that its incredible. It shows that a pithy statement will turn peoples intelligence off. Think about any persecuted minority and then repeat the statement one more time. I dare you.

So back to the library.

The books I read can tell you a lot about me. But the problem is that you cannot know what books I read and how they have impacted my life from a list of books I have borrowed from the library. This list will tell you nothing about whether I read them, how I understood them, if I read them to criticize or to admire… or just to impress someone else. All you know is that I have borrowed them. Unfortunately, in times of stress, such data will be used as “proof” of something. And not only in times of stress.

In January this year the Swedish Justitiekanslern (Chancellor of Justice) found that the university library in Göteborg (my uni) was not wrong to save data on borrowed books and the borrower even after the books were returned. (case 2356-09-42: Personsuppgiftslagen (1998:204) är inte tillämplig på personuppgifter i ett låntagarregister som förs fortlöpande vid ett universitetsbibliotek decided 2011-01-17

Their reasoning is that the information about the books and borrower fall under the well established Offentlighetsprincipen (principle of public access) and would be saved – and made accessible to anyone who wants it. Information that falls under Offentlighetsprincipen may be removed from the archives under certain conditions.

In the case of the books individual borrowers have borrowed this data is removed two years after the library card expires.

Since I have had a library card at my library since 1997 or maybe even earlier all the books I have every borrowed from my university library are a matter of public record and can be extracted by anyone.

So I am dismayed, but not surprised, by the outcome of the decision by the Chancellor of Justice. But what really gets me annoyed is the attitude of the libraries. This is not the kind of data they should be collecting. This is not the attitude they should be having towards their readers. Their behavior does not promote openness, but rather will decrease the likelihood of people reading “suspect” material – whatever that may be. I thought libraries were all about open mindedness and learning. Now I am sure that what they are doing is convenient for them – and we have come to expect companies selling the souls of their employees and customers for their convenience.

But libraries? For shame.