skud: (Default)
skud ([personal profile] skud) wrote2011-02-01 06:30 am

Ebook discussions flying under the radar

Mirrored from Infotropism. You can comment there or here.

A couple of weeks ago, I started seeing a pile of blog discussion about ebook piracy. It all started on January 12th, when Australian fan lucyham tweeted to author Sarah Rees Brennan:

Apologies. Have just torrented The Demon’s Lexicon. Will buy when laggardly, pickpocketing, luddite publishers in Aus get around to allowing Australians to buy books off Amazon. So sick of “Aussies can’t purchase this book” message.

A twitter storm ensued, which quickly made it to various blogs. Author Saundra Mitchell posted on her blog and also on her LiveJournal, saying:

If even HALF of those people who downloaded my book that week had bought it, I would have hit the New York Times Bestseller list. If the 800+ downloads a week of my book were only HALF converted into sales, I would earn out in one more month. But I’m never going to earn out. And my book is never going to be available in your $region, not for lack of trying.

Things took off quickly; there were almost 20 posts that day, mostly in response to Mitchell. In the next two weeks, or a little more, almost a hundred posts on the subject sprang up, more than two-thirds of which were on LiveJournal or Dreamwidth (a LiveJournal-like site, based on the same code; many people crosspost between the two.)

Reading these posts as they flowed past, I noticed several interesting things about them. Firstly, many of them were addressing the issues of ebook piracy from an angle I had never seen before, criticising the capitalist structures of book distribution and intellectual property from a social justice perspective. Secondly, most of the posts seemed to be by women. Thirdly, nobody outside the circles of LJ/DW fandom and social justice circles seemed to be noticing. It seemed a pity. I’m a regular reader or visitor to many tech blogs, including O’Reilly Radar, TechCrunch, ReadWriteWeb, and pop-culture-meets-tech blogs like Boing Boing, any of which might have picked up this story and run with it, if they’d been in the loop.

I wondered, though, whether I was missing something. I know that we tend to follow people most like ourselves online, and read things that reinforce our own views and opinions. If men disproportionately follow other men, maybe I’m disproportionately reading posts by women, and there were a whole bunch of posts by men that I’d missed. It’s happened before, after all.

Tonight I decided to investigate. Using Google Blog Search and following links from any posts I found, I put together a spreadsheet of posts, 112 in all, on the subjects of ebook piracy and international distribution, between Jan 10th and Jan 27th. (Why Jan 10th rather than 12th? Turns out that O’Reilly Radar had posted an article about ebook piracy and DRM on the 10th, which was referenced by other bloggers over the next few days, so it seemed worth including. More notes on my methods and choices made are at the bottom of this post.)

I then took the spreadsheet and ran it through a few lines of Perl to generate the following GraphViz graph:

ebooks discussion (600px)

Visualisation of the ebooks discussion. Version 1.0, 2011-02-01. Full image: SVG, PNG


  • A <- B means that post B linked to post A
  • Gender of poster is shown by colour of the nodes (pink for female, blue for male, grey for unknown/other)

Here’s what’s going on.

  • The giant tangled blob taking up most of the image is the discussion sparked by lucyham’s illegal download of Sarah Rees Brennan’s book, and Saundra Mitchell’s subsequent blog post. As you can see, there are the best part of 100 posts, mostly by women. This discussion ran from at least the 12th to the 27th of January (and the post you’re reading right now extends it into February).
  • At the top of the chart are some small clusters showing conversations not connected to the main LJ/DW conversation. The first is small cluster mostly around posts by Chris Walker of, criticising publishers who don’t make their books available to Australian consumers. This discussion ran from at least Jan 11th to 25th, but never crossed over with the discussion sparked by lucyham (also Australian).
  • Just below that is a set of posts about ebook piracy and DRM circumvention, mostly centred around O’Reilly Radar’s interview with Brian O’Leary, who says that DRM doesn’t prevent piracy. O’Reilly run the Tools of Change for Publishing conference, and that blog post was part of the lead-up to that event. This discussion played out from the 10th to the 18th of January, without linking to any of the other discussions in progress. (Nor, to be fair, being linked from them; the community discussing ebooks in January was as unaware of O’Reilly as O’Reilly were of them.)
  • At the extreme top right, a single post by Mike Shatzkin, on the globalisation of ebook publishing, was posted on Jan 21st but doesn’t link to any of the aforementioned conversations. It’s included purely because of its topicality, even though it wasn’t connected to anything else.

I’m not sure what conclusions to draw from this, other than that my gut feeling was correct: there was a really fascinating, complicated, crunchy conversation going on, mostly among women, mostly on LJ/DW, that the tech blogs and other parts of the web don’t seem to have noticed. Make of that what you will.

If you missed the discussion and would like to catch up on some of the highlights, I would recommend:

Links to all ~100 posts are in the spreadsheet o’ doom.

Some notes on methodology

  • How I found posts: a) Google Blog Search for “ebook piracy” between Jan 10th and 31st, b) following links from any blog posts that showed up, c) “linkspam” posts, especially this one. (The LJ/DW community, especially those involved in fandom and social justice issues, have a tradition of gathering linkspams to document hot issues as they happen.) I followed up with additional blog searches for related phrases, for instance “saundra mitchell”, or “brian o’leary” to find more posts linking to his O’Reilly Radar interview.
  • What posts qualified: any post that had at least a paragraph of original opinion, or was linked to by any other post already in the set. Therefore, posts that simply linked without additional commentary didn’t count. Posts outside of the time period (Jan 10th to 31st) were not included, no matter how often linked.
  • What counted as a link: only links in the main body of the post; I didn’t look at links in comments. However, if there was a link to a comment thread, I counted that as a link to the parent post.
  • Gender: I based gender on the stated gender of the poster (in user profiles, self-referential pronouns, etc), given names, pictures of the poster, and/or circumstantial evidence, in roughly that order of preference. In many cases, I have met the poster in person, and so used my personal knowledge.
  • Proportion of LJ/DW posts: when calculating the number of LJ/DW posts in the conversation, I counted a post if it was crossposted to at least on LJ-like platform.

Please, if I missed anything, mis-categorised, or mis-gendered anyone, let me know, and I’ll update the spreadsheet and the generated graphics.

elf: Computer chip with location dot (You Are Here)

[personal profile] elf 2011-02-01 09:38 pm (UTC)(link)
Random comment: When I do link roundups, I get more useful results with Icerocket's blog search than Google; it collects more LJ links. (It doesn't collect many DW links, but I think that's because more DW people have spidering turned off.)

And searching for "e-book piracy" got different results, because some people don't use the word "ebook."
elf: Computer chip with location dot (You Are Here)

[personal profile] elf 2011-02-01 10:02 pm (UTC)(link)
Yeah, if you got links from my roundup, I'd already collected everything I could find at Icerocket, Google, LJ's search & DW's search, with "ebook" and "e-book" (at DW, I just searched for those alone, and for "piracy" alone, because the results were small enough to not mind the false positives), "book piracy," "book pirate," and "author piracy."

Searching for "ebook" alone at LJ was ... informative. I learned how active the spam blogs are.

Google apparently weights their results by some kind of blog popularity system that excludes most of the "journal" blogs.
foxfirefey: A wee rat holds a paw to its mouth. Oh, the shock! (myword)

[personal profile] foxfirefey 2011-02-02 12:49 am (UTC)(link)
Actually, Icerocket, as far as I know, either doesn't check the no-spidering rules or caches them so aggressively that people who have turned off spidering a while ago will still show up. I've had to make sure when using it to NEVER jump in with a suggestion without first checking the robots status of a journal, so I don't end up being a Creepy McCreeperton.
elf: Computer chip with location dot (You Are Here)

[personal profile] elf 2011-02-02 01:17 am (UTC)(link)
How do you check the robots status of a journal? (Am nongeek; had no idea that humans could check this.)

I don't think it's likely to change my habits for linkspam roundups; I also use less-automated methods like "any post with lots of comments, I check out that person's f'list for related posts." And I use the search at DW, which has opt-in for site-only searches that aren't caught by Icerocket.

I also get a lot of Icerocket results that point LJ but not the crossposted DW posts; I wonder if it's LJ that plays havoc with people's search settings.
foxfirefey: A guy looking ridiculous by doing a fashionable posing with a mouse, slinging the cord over his shoulders. (geek)

[personal profile] foxfirefey 2011-02-02 01:22 am (UTC)(link)
It's just a meta tag in the head section of the source, so while it's not apparent if you're just looking at the page in your browser, it's easy to find.

If you use Firefox, Tools -> Page Info will show it to you nice and simple in the Meta section. Basically, if there's a robots tag and it has noindex, nofollow, noarchive...Icerocket is being naughty!

I don't think LJ is playing havoc, I think Icerocket is being awkward and not respecting settings, and doesn't pay much attention to Dreamwidth.
foxfirefey: A guy looking ridiculous by doing a fashionable posing with a mouse, slinging the cord over his shoulders. (geek)

[personal profile] foxfirefey 2011-02-02 03:36 am (UTC)(link)
Oh, true! I just use that as a shorthand on an LJ entry, since if it's in the head tags, they also have it in the root directory--it's all controlled by the same setting.
nineveh_uk: Picture of a wild rabbit with text "I hope your rabbit dies" (Default)

[personal profile] nineveh_uk 2011-02-03 06:31 pm (UTC)(link)
Random thanks for icerocket, which I had never heard of.