The Case of City of Bits
In 1995 I had a chance to explore these questions in a practical
context when, with the MIT Press, I published my book City
of Bits. Since it dealt with the digital revolution and the
new relationships that were being created between the material and
virtual worlds, we decided that it should be self-exemplifying-that it
should appear simultaneously as a hardback (3) and in a full-text World
Wide Web version. (4) As far
as I know, it was the first book to be published simultaneously in
print and on the Web. (At the very least, it could not have had many
predecessors.)
We made the marketing people happy by providing a link to an
online order form
from the opening screen of the Web site; enter your name
and address, include your credit card number (in a secure transaction),
click to transmit your order, and a copy gets sent to you immediately. Conversely,
we published the URL (the address in cyberspace) of the Web version on the
dustjacket of the print version. So a reader of either one could always
conveniently obtain the other.
We provided free access to the Web version. (As the Web develops, convenient
mechanisms for charging for access to online material are being put in place,
and these will obviously be crucial to the development of an online publishing
industry. But these were not highly developed when we put City of Bits
online, and attempting to charge just didn't seem worth the trouble at that
point. (5)) There was some risk
in this, of course; why would anyone buy a copy when the online version
was right there at no cost? Perhaps we would lose sales. But we guessed
that the additional sales generated by the Web site would outweigh such
losses, and there is some good evidence that we were right; in the first
two printings, about 2% of the total sales were directly through the online
order form, and it is likely that the Web site also stimulated bookstore
and mail-order sales.
Why should this be so? The answer is that the hardback and online versions
added value to the text in different and complementary fashions. (The dimensions
of that complementarity will be explored in the discussion that follows.)
So readers of the Web version are not necessarily potential customers for
the hardback. And lots of people decided that they wanted both, to use in
different ways.
Hardback, Paperback, and No-back
Of course, publishing a book in different versions is not a new idea; it
has long been a common strategy to put out both hardbacks and paperbacks.
The hardback is more expensive and more robust, and it is aimed at libraries
and at buyers who want to keep it permanently on their bookshelves, while
the paperback is cheaper is not designed to have such a long life. Depending
on the content and the marketing strategy for a particular book, it may
appear in hardback only, in paperback only, in paperback with a small number
of hardbacks for sale to libraries, or in hardback followed by a less expensive
paperback at a later point.
With the Web, the online no-back emerges as a third option at the inexpensive
and ephemeral end of the spectrum. It can be used, even by very small publishers,
to achieve instant world-wide distribution; certainly we found, with City
of Bits, that it was quite widely read and even reviewed in some countries
long before copies of the hardback were available there. But, since publishers
generally have not begun to guarantee the permanent existence of Web sites,
you still need a hardback copy if you want to be sure of continued access
in the future.
You may also want a well designed, well produced print version for ease
of extended reading, portability, and just the sheer pleasure of it. By
comparison with even the very best laptop computer, a well made book is
light, tough (you can drop a book without damaging it, but not a laptop),
comfortable in the hand, and usable anywhere. It has an extremely high-contrast,
high-resolution display, and the access mechanism (turning pages) is a lot
nicer than using a mouse and cursor to scroll text down a screen. Indeed,
I have often thought that, if Gutenberg had invented the personal computer
and printed books had not appeared until the 1980s, we would now be hailing
paper and print as a major technological advance!
As forward-looking computer technologists will be quick to point out, things
won't stay this way. Computers will become lighter, less fragile, and more
portable. The quality of displays will improve. Sophisticated home and office
printers will allow production of high quality, personalized print copies
on demand. We may even see the emergence of programmable "smart paper" allowing
development of devices that combine the virtues of the portable computer
and the book. ( 6). But, for the
moment at least, the hardback, the paperback, and the electronic no-back
have significantly different properties and roles.
Getting the Reader's Attention
The first task of a book especially a trade book that's supposed to attract
an audience is to get itself picked up and read. So the hardback City
of Bits has a vivid, colorful dustjacket to catch the reader's attention;
it's carefully designed to stand out on a bookstore display or a library
shelf. When you take it in your hand, you find a brief description and author
biography on the flyleaf. Then you can flip through it to see what's inside.
The Web version clearly had to attract attention in very different ways,
and making sure that it did so was a key to success. Several strategies
were used.
First, a hot-link was made from the entry in the MIT Press's online catalogue to the City of Bits site. So much as bookstore browsers
can pick up a copy of the hardback Web-surfing catalogue browsers can immediately
get their hands on the online version. And the first thing that the online
version presents is a Welcome page with links to a Synopsis,
the author's Home Page, and the Table of Contents. Thus, to provide one
path into the online City of Bits, the metaphor of an "electronic bookstore"
was fairly closely followed.
Hot-links from other Web sites provide a second way in. City of Bits
was quickly listed in many online, classified Internet and Web guides, "Cool
Sites" collections, online newsletters and magazines, home pages of
organizations and individuals who wanted to draw attention to it, and online
reading lists for classes of various kinds. Some of these links were sought
and negotiated by members of the City of Bits WWW team, but many
appeared spontaneously. Most were one-way, from the other site to City
of Bits, but some were reciprocal a fixed "you point to me and I'll
point to you" arrangement. The ultimate effect was to create a very
large, electronic "catchment" to collect potential readers and
efficiently funnel them to the site.
The third strategy for bringing in readers is to attract the attention of
Web search engines. Typically, these engines explore the Web
periodically to create large indexes and directories, then, in response
to users' queries, employ these indexes and directories to provide very
rapid access to the relevant Web sites. They perform their explorations
in a variety of ways-by looking for specified keywords in the titles or
headers of Web documents, by scanning through the documents themselves,
or even by searching other indexes and directories. They are usually pretty
dumb, since they just look for keyword matches. So, to make sure that your
site is not missed by the search engines which have now become very important
tools for finding one's way around the Web you must make sure that the appropriate
descriptors are included in titles and headers, and in the text of the opening
pages. Incidentally, you can reliably attract a lot of attention by scattering
words like "sex" and "nude" through your text but it
may not be the sort of attention that you want!
A fourth possible strategy, which we have not used, is closely analogous
to pinpoint direct-mail marketing. When Web-surfers access your server,
it is technically possible to collect a lot of information about them who
they are, where they are from, what links they followed to get to your site,
what browser they were using, what they looked at, and so on. If you are
prepared to ignore the obvious privacy issues, you can use this information
to target electronic advertising. So, for example, Web-surfers who looked
at MIT Press online catalogue entries for other books on related topics
might get email promoting City of Bits.
Reading Tools and their Effects
In traditional fashion, the hardback version of City of Bits is
a narrative divided into chapters on different sub-topics and it has a table
of contents and an index to guide the reader through the material. This
allows for multiple styles of reading; you can follow a continuous thread
straight through from beginning to end, you can jump immediately to particular
chapters that interest you, you can use the index to find passages on particular
topics, and you can even cruise the index (or the endnotes) to look for
entries that may pique your interest. You can skim quickly or you can read
more slowly and attentively. You may make notes as you go, or you may not.
You may read in strict sequence, or you may jump back and forth.
The physical book is not only a repository of the textual information, but
also a reading tool that allows you to pursue these strategies efficiently,
and gives you context and feedback as you do so. Its size and shape tells
you roughly how much information it contains, and you always know how far
through it you are from the relative thicknesses of the stacks of pages
under your left and right thumbs. The springiness of the paper allows you
to scan quickly by riffling through pages with the book half open, but the
mechanical properties of the binding assure that you can also leave it open,
flat on a desktop, for more extended and careful study. Typography signals
the hierarchy of information by visually distinguishing headings, sub-headings,
and body text. A Table of Contents right at the front, an Index at the very
back, and numbered pages, provide effective search and navigation capabilities.
Endnotes, with numbered references from the text, allow backup information
to be provided without disrupting the flow of the narrative.
The online version provides very different reading tools. Most dramatically,
there is no index; it is replaced by an internal search engine that locates instances of user-entered keywords in the text. From the author's
viewpoint, this eliminates the intellectual drudgery of creating an index.
From the reader's viewpoint, it provides greater freedom; you can search
for anything, and you don't have to rely on the author's judgment about
what was worth including in the index. (I'm told, for example, that many
readers immediately type in their own names to see if they're mentioned
anywhere!)
The hierarchy of information is also handled differently in the online version,
since the screen can only display a limited amount of text at one time,
since current bandwidth constraints make it undesirable to download large
text files to your browser all at once, and since scrolling through a long
segment of text doesn't work nearly as effectively as flipping the pages
of a book. The complete text is organized into a hierarchy of small segments,
with internal hot-links providing the interconnections among them. At the top of the tree is the Table of Contents page providing entry points to each of the chapters. Within each chapter, there is
the introductory section of text followed by hot-links to the subsections
that it contains. Finally, there is the relatively short text
of each subsection. To allow for sequential reading of the narrative, without
having to go up and down the hierarchy, there are "previous" and
"next" hot-links at the end of each subsection.
Endnotes, of course, are handled by hot-links; click on the endnote mark
and you immediately get the corresponding note. (Cross-references within
the text could be handled in a similar way, but there aren't any.) To maintain
consistency with the print version, and continuity with tradition, the notes
are numbered-but, of course, they no longer really have to be, since there's
never any ambiguity about which note relates to which point in the text.
Overall, the reading tools provided with the online version have a very
interesting effect; they privilege the hierarchical structuring of the book's
content and the operation of searching while they make sequentially following
the narrative more cumbersome and difficult. (It's no accident, then, that
CD-ROM and online books that have these sorts of reading tools have tended
to emphasize modular, classified and indexed chunks of content as in encyclopedias
and dictionaries, to provide dense cross-referencing within the material,
and to construct multi-threaded and branching narratives-in other words,
to focus on anything other than long, continuous narrative sequences.) The
hardback, on the other hand, privileges skimming, random jumps back and
forth, and the continuity of the main narrative thread. So it's probably
optimal to read the hardback first, to gain an overview, then to go to the
online version for more detailed study and for ongoing reference.
Fixed-Format and Personalized
Good graphic designers exert very considered and precise control over the
look and feel of a printed book. Certainly this was the case with City
of Bits. The designer, Yasuyo Iguchi chose to set it in Bembo and Meta.
She arranged elements on the various different sorts of pages, and deployed
white space with care. She gave consideration to its size, shape, proportions,
weight, and rigidity. She chose the paper, the cloth for the cover, and
the matte varnish of the jacket so as to create a particular relationship
of feels and textures. All of this matters. It all adds up to something
that has the characteristic look of a MIT Press book, and that signals something
about the product's style, content, and level of sophistication.
But the client-server architecture of the Web does not allow a designer
such precise control of the online version; it may be downloaded to many
different types of display devices, by many different types of browsers,
with many different settings of their various options, to produce screen
displays that vary enormously. This can be seen as a disadvantage (and typically
is by graphic designers, who don't like the loss of control), and the producers
of Web servers and browsers can try to eliminate as many sources of unwanted
variation as possible. Or it can be seen as an advantage-opening up the
possibility of adapting content intelligently to different contexts and
to the needs of different readers; perhaps every reader of City of Bits
could have a uniquely personalized version. ( 7).
The issue of producer-control versus user-personalization is a philosophical
rather than a technical one; it is technically feasible to implement systems
that support either one or both, and to design online productions that either
go for a consistent look or encourage personalization. In the online version
of City of Bits, we tried to exert as much control as possible to
assure a reasonably high level of graphic quality, to remain consistent
with the print version, and just to keep things simple for ourselves. But,
as personalization tools become increasingly sophisticated, it will become
more interesting to try to take advantage of them.
External Hot-Links
Perhaps the most obvious and striking difference between the hardback and
the online version is that the text of the online version contains hundreds
of hot-links to other Web sites with relevant information on the topics that are discussed. When I discuss online shopping malls, for
example, you can just click to go and visit one. And, when I refer to Aristotle's Politics, you can immediately access the relevant passage, online,
in either English or Greek. Thus the City of Bits site becomes
a conveniently organized entry point for exploring an enormous quantity
of related information.
Some of these external hot-links are to sites that I or my research assistant
discovered and consulted when City of Bits was being written, but
the vast majority have resulted from systematically going through the text,
picking out key words, and sending search engines out on the Web to find
what was out there. Whenever a search engine discovers a relevant site,
we link it in. (You can think of this as a new form of bricollage.)
This process has to be repeated at regular intervals, since the Web is growing
explosively, and relevant new sites are continually appearing. So the structure
of intertextual linkages in which City of Bits embeds itself is
a very dynamic thing, and it looked very different, after the site had been
up for a few months, than it did when it first went online.
The converse process is to combat link-rot by identifying and removing hot-links
to sites that have died, shifted to new locations, or become irrelevant.
(If this is not done, a site quickly loses its charm like an untended garden.)
To facilitate this, we employ a software tool that automatically runs through
the text, checks all the hot-links, and reports all those that don't seem
to be working.
Superficially, adding these links may just seem to be a more convenient
way to provide endnote citations to related publications. But, on closer
inspection, there are some important differences. One is the dynamism that
I have noted; print endnotes can only be updated, all at once, when there
is a reprint or a new edition, but hot-links can be updated incrementally
and at any time. Furthermore, you cannot add too many endnotes to a printed
book without making it bulky and unwieldy, but there is no practical limit
to the number that you can embed in an online text.
But the most important difference is the shift in scholarly responsibility,
and correspondingly in the reader's use of the text, that the substitution
of hot-links for endnote citations entails. Recall that endnote citations
are normally to printed documents that have been formally published and
do not change. A responsible scholar is expected to check the relevance,
quality, and usefulness of a cited document, and to give publication date
and page numbers; scholars who cite irrelevant or poor-quality publications
are not highly regarded. But the author of an online publication cannot
attempt to take the same responsibility, since the contents of an externally
linked site may change unpredictably, at any time; I might, for example,
discover a site containing the text of Aristotle's Politics, check
it out and assure myself that everything was in order, and then make the
link from City of Bits only to discover, some time later, that
the operator of that site had subsequently substituted several hundred pornographic
GIF files for the philosopher's words. So, external hot-links are very useful,
but they have their dangers. Caveat surfer!
As the Web and similar structures mature, there will undoubtedly be an increasing
number of sites providing stable, "guaranteed" content, and scholars
will have less of a problem. There are, for example, already some refereed
online technical journals. ( 8).
But the medium does not automatically enforce document stability in the
way print does, so special institutional arrangements will be needed in
contexts where such stability is necessary.
Marginalia and Readers' Comments
Sometimes readers like to scribble their comments in the margins of printed
books, and sometimes subsequent readers see these comments and may even
add their own responses, but this usually isn't encouraged (particularly
with library books) and it isn't a very effective form of discourse. By
contrast, online versions of books can easily provide for readers to add
their comments, and for these comments to be widely available.
In the online City of Bits, readers can enter an electronic
"agora"
directly from the site's front door, or from the foot of any page of
text. There, they can read the
(comments)
that other readers have posted. They can also use a simple
form
to add their own comments. And they can even insert hot-links to other
sites that they consider
relevant. This agora is organized as a collection of newsgroups, and provides
all the usual features of newsgroup support software.
Over time, then, the online version of City of Bits has become
encrusted with commentary. It has succeeded in provoking, capturing, and
making visible a discourse in a way that is impossible with print. And,
in the process, the seed provided by the original text has grown into a
considerably larger and richer textual structure.
This evolution is fascinating and exciting to see, but it creates some theoretical
connundrums and practical difficulties. The continually growing, transforming
structure is actually the work of many hands, yet it has my name on it.
In the beginning, it was mostly mine, but it becomes less and less so as
time goes on and the online comments accumulate. At what point does it become
inappropriate to say that it is "my" text? When does it become
more reasonable to call it a collective work?
Who bears moral and legal responsibility for it? Should I treat the agora
as a zone in which complete freedom of speech is permitted, or should I,
as the author, take responsibility for actively moderating and shaping the
discussion? Should I delete blatantly irrelevant and self-serving comments?
What if advertisements are posted? What if a reader were to post comments
that I found personally offensive and insulting? (Am I obliged to provide
that person with a platform?) What if a posting were found to contain slanderous
or obscene material, or a neo-Nazi diatribe? These are not the sorts of
questions that arise about scribbled marginal comments in printed books,
but they have been hotly debated in relation to online newsgroups and bulletin
boards. A book becomes a thing of a different kind when it systematically
internalizes and reports back the discussion that it has provoked, rather
than standing distinct, closed, and aloof from it.
These seem difficult questions, and general answers will probably have to
be worked out through experience and debate. In the case of City of
Bits, the team that maintains the site has taken a rigorous "hands
off" attitude; we occasionally go through and clean out the completely
irrelevant postings that sometimes appear, but we leave everything else
there. Generally, comments so far have been serious and responsible, so
we have not been forced to confront any really troublesome dilemmas. Perhaps
we have just been lucky, though.
Reviews, Mentions, and Translations
Any successful book soon generates a growing body of thematically related,
secondary, and derivative texts reviews, commentaries, news articles,
mentions in other works, and translations. The City of Bits site
keeps a running record of this sort of material (to the extent that the
team can keep up with it) and, where possible and appropriate, provides
links to it.
As it turned out, City of Bits generated a lot of interest, and
quickly received many reviews in both the specialist and mainstream media.
Perhaps naively, we had hoped that we might add the full texts of all reviews
to the site as they appeared. That would have made accessible another, extremely
interesting, layer of commentary and elaboration. But the world is not quite
ready for that; after a few attempts to secure permissions to reproduce
complete reviews online, and generally getting rebuffed or asked to pay
exorbitant fees, we retreated to the position of posting short extracts much
as they have traditionally been reproduced in jacket copy and advertisements.
In future, though, it may not be so difficult to achieve our original ambition;
when the majority of reviews appear in online editions of newspapers and
magazines, and the like, it will only be necessary to link to them.
As translation rights have been sold, details on the forthcoming foreign-language
editions have been posted in a Translations section of the site. When the translations are completed, we will explore
further possibilities. (This will require making new and unusual types of
agreements with the overseas publishers, and it is not yet clear how these
will work out.) For example, we might simply add online texts of the foreign-language
versions to the City of Bits site. We might go further, and provide
structures of cross-linkages among the English and foreign-language versions
so that multilingual readers might conveniently move back and forth a particularly
useful capability where words and phrases do not have very exact equivalents
in other languages, or where there might be ambiguity or debate about the
best way to translate things. Or we might encourage the foreign publishers
to develop their own Web sites for the translations, then build links to
and fro. In the more distant future, it is easy to imagine online books
existing as multilingual, geographically distributed sites in which you
are asked, on entry, what language you want to use-as in American Express
cash machines.
Online Appropriation
In effect, the various external linkages from the City of Bits
site appropriate a vast array of existing textual fragments and combine
them to form a new work something that, because of the selection and organization
that goes into it, is significantly greater than the sum of its parts. The
original City of Bits text, as published on paper, is just one
of these constituent fragments though, to be sure, a privileged one. (This
shifts to a radically new context the old idea, recognized in intellectual
property law, that a collection can be a creative work.)
This strategy of textual appropriation and collage does not run into the
sorts of intellectual property difficulties that would arise in creating
a large, cross-referenced print collection, since the constituent fragments
are merely pointed to rather than reproduced. The author of an appropriated
text does not lose anything in this way. On the contrary, authors usually
post texts online because they want them to be noticed and read, so it is
an advantage to attract linkages that might channel readers from other texts
and sites.
In sum, an important new literary role has now emerged that of the link-editor
who locates fragments of text online and combines them into original literary
structures by superimposing patterns of linkages. On a large scale, the
operators of Internet guides like Yahoo! play the link-editor role
by selecting and classifying online material and providing convenient point-and-click
access from a topic list. Pedagogues play the game when they link words
in books and articles to online reference works dictionaries, encyclopedias,
and so on. Critical scholars play it when they create structures of comparisons
and contrasts among texts. The City of Bits team certainly played
it when they constructed the online version. And, by now, the online City
of Bits has been appropriated into a great many online constructions
created by others.
When I have discussed this form of appropriation with other authors, some
of them have been greatly disconcerted by the idea. They do not like the
possibility that their work might be used in ways they cannot control and
for purposes that they never intended. (They forget, of course, that authors
have never really had very much control over the uses and misuses of their
published texts. But embedding in online link structures does make this
possibility dramatically explicit.) Others, including myself, are excited
by being able to see with new clarity the evolving roles that their texts
play in ongoing discourses.
Stabilities and Instabilities
As we have now seen, the online City of Bits has both stable and
unstable elements. The core text, which corresponds to that of the print
version, does not change. But the structure of links that it carries is
continually adjusted and extended, the contents of the externally linked
sites evolve, and the accreted structure of comments, reviews, and translations
grows. If I decide to do new print editions, I expect to add the text of
those to the online version, and to preserve the earlier edition texts as
well. Thus any change in the core text will be carried out in well-defined,
modular increments.
A more radical possibility would be to make continual small changes to the
core text to reflect new developments and to respond immediately to comments
and criticisms. (There is no technical difficulty in doing so.) That way,
the text would be kept constantly up to date; there would be no need to
keep using an increasingly obsolete and unsatisfactory text while waiting
for the right moment to put out a complete new edition. But this would destroy
the logical integrity of references within the overall structure. What if,
for example, a reader's comment refers to a specific paragraph in the core
text and that paragraph is subsequently deleted or significantly altered?
Perhaps the most satisfactory approach would be to preserve successive versions
as incremental changes are made. Some fairly straightforward software could
then automatically relate comments and other linked material to the appropriate
versions. So far, though, we have not had the energy or the disk space for
that.
Whatever the balance between stable and unstable elements, though, you never
read the same text twice. (Heraclitus would have loved it!) Even the internally
stable elements are continually being recontextualized, and so shift in
their meaning, as the huge structure that embeds them transforms itself.
Furthermore-an alarming thought for historians it is quite impossible to
preserve more than a very partial record of the past states of that transforming
structure; it has no distinct boundaries, it is distributed over many different
machines in widely scattered locations, and it is far too large and complex
to back up on tape. The printed book appeared to give scholars stable, repeatable
text modules to work with. Perhaps that was always a myth. With online books,
certainly, that myth is increasingly difficult to sustain.
The End
Hardback and paperback books eventually go out of print. Archival libraries
selectively perform the function of preserving books after that point. But
what about online books? Since it does take some effort and resources to
keep them around, and even more to keep them growing and changing, they
are likely to have quite limited lives. How long do they stay available
online? What is the electronic equivalent of going out of print? Who is
responsible for long-term archiving?
Answers to these questions are likely to vary with the type of book, and
may change over time as online publication grows in importance, but I can
give a provisional answer for City of Bits online. I regard it
as a kind of extended live performance in a vast virtual theater. Eventually,
that performance will end. The site that remains will not instantly disappear,
but will slowly fade away like an abandoned stage-set-as link-rot sets in
and as additions and updates are no longer made. As time goes by, there
will be fewer and fewer visitors.
In the end, the City of Bits will be an electronic ruin. Like Troy, it will
cease to function and to live becoming, instead, part of the archaeology
of cyberspace.