This is the image I'm considering using to overlay my project.
Wednesday, November 13, 2019
Success!
I now have a list of more than 400,000 Orange County property parcels with building permit dates. Now I need to find an affordable way to geocode all those addresses! I'm currently looking at Texas A&M GeoServices--it appears as though they will allow me to do it in batches of 2500 for free if I credit them, which I most certainly will.
Make sure and go here if you want free geoservicing: http://geoservices.tamu.edu/
Make sure and go here if you want free geoservicing: http://geoservices.tamu.edu/
Tuesday, November 5, 2019
Digital Project Progress Report
- Title: Orlando: from 85 people to 300,000
- Abstract - There is a field (shown above) that shows the history of building permits on the property for a significant percentage of Orange County addresses--I have seen it go back as early as 1929, and have only found one that indicated no buildings were associated with the address. The employees at the Orange County Property Appraiser's Office are happy to share with me the data, charging only a nominal fee if it takes substantial clerical time to pull. Armed with a csv or .xlsx file with the addresses and dates, I can use www.gpsvisualizer.com to do a bulk address-to-coordinates conversion, and then transfer all of that processed data into visual eyes for a powerful illustration of the property development of Orlando over the past seventy to one hundred years.
- Principal Data Sources: Orange County Property Records
- Visualization Platform: VisualEyes
- Notes: I almost called this blog post "Getting Earlier than 1994." After an encouraging phone conversation, I sent the following email to an employee at Orange County Property Appraisers (OCPA):
The employee reassured me that I had come to the right place, and if I came by he/she would provide the data I need and I would not even need to pay the $50 special request fee. I was jubilant...and then more than a little bemused last week when I learned that what the employee had for me went back as far as 1994. After about 20 minutes there, I left with the reassurance that they will find where that building permit data is stored, because they could not find it within the data that they were about to give me. I have not heard back from them, and so tomorrow is probably the time when I will have to cut my losses and switch to a modest alternative digital project, their enthusiastic desire to help notwithstanding.
Tuesday, October 29, 2019
Environmental Scan
In 1875, there were fewer than 100 people who called the new
city of Orlando home. Fifty years later, the Old Cheney Highway still could not
accommodate an automobile from one side of Orlando to the other. The first
traffic light was not installed at the intersection of Colonial and Maguire
Boulevard—less than 2 miles from Lake Eola in the heart of downtown—until 1963.
What did the geographic transformation of Orlando look like?
I am interested in this question as a native Floridian who
will never cease to marvel at the rapidity of urbanization since the second
industrial revolution began to push agriculture to the margins with the speed
of a hydraulic piston. In the not too-distant future, I hope to see what it
looks like in the context of race. As Orlando has grown, its white citizens used
legal and extralegal means of segregating the city racially. Since 1965, most
of its citizens seem to this researcher to have made modest progress toward
integration and racial harmonization, but varying degrees of fatigue, reaction,
resentment, and even despair continue to belie the unity implied by the etymology
of the word community.
With that background, I am grateful for three recent works
of digital scholarship, the “Human Population Through Time” animation produced
by the American Museum of Natural History in 2016, Mapping Inequality:
Redlining in New Deal America from 2013, and Renewing Inequality: Urban
Renewal, Family Displacements, and Race 1955-1966 from 2018—the latter two
works come from the exemplary Digital Scholarship Lab at the University of
Richmond.
Human Population
Through Time
This presents the visualization that I am hoping to achieve,
albeit at a vastly superior production quality than I will be able to reach. After
something of a preamble 100,000 year prologue, it combines a line graph of the
human population with a pixel-density map of the major urban areas from the beginning
of the Common Era to present. The American Museum of Natural History’s
production team were a bit curious in their selections of major civilizations
to indicate with an icon superimposed over the map (why Roman and Han Empires,
but not Gupta?), but it is not too distracting. The really impressive element
is their sourcing. Anything at this scale is going to include a lot of
speculation, but they do not trust in one scholar. This is part of the World
Population History Project, and its sources and credits page is immense—more than
739 separate sources.
Mapping Inequality:
Redlining in New Deal America
The New Deal’s Home Owner’s Loan Corporation made incredible
maps that color-coded America’s major urban areas in four grades: Best, Still
Desirable, Definitely Declining, and Hazardous, color-coded (no pun intended)
green, blue, yellow, and red, respectively (hence red-lining). The maps are
becoming notorious, and it will be an indictment on Americans if the work of
the people at the project do not make them more popularly-known and downright
infamous. The following excerpt describing the now-extremely-rough area of Jacksonville
I grew up in crystallizes the casual racism well:
The Panama Garden area and the southern portion of Northwest New Springfield are generally considered more desirable residential property than that in the remainder of this Section. The Section is occupied by 100% whites consisting principally of railroad men, City employees, clerical, and the lower-income salaried workers with annual incomes ranging from $1,000 to $2,500. Seventy percent of the occupants in this Section are home owners and there is no shifting of population [read: the whites are staying put and will not sell their homes to any more affluent blacks who might offer them a good price on their homes].”
Mapping Inequality is super strong in showing
powerful interactivity and model georectification—it’s fantastic.
Renewing
Inequality
I believe this will be the last major historical puzzle
piece that can show me by analogy in Daytona Beach, Tampa, and Miami what almost
certainly was done here in Orlando—the tragic preservation of inequality and segregation
through urban renewal and urban housing developments. I must confess that its
user interface is something of a cautionary tale to me of ambition; there is
just so much data to sift through it is pretty daunting rather than enticing to
this user, and I suspect I am a good bit more savvy than the median digital
humanist. Still, all that information is an absolute Fort Knox of historical
gold—it powerfully illustrates the scale and pervasiveness of the family
displacements of all races in this period.
Links:
Tuesday, October 22, 2019
And Now, I (AS)SERT...
My thesis research
is focused on the peculiarity of the McCoy Air Base (now Orlando International
Airport) when it was an island of racial integration within a sea of Jim Crow
Orlando. The city’s de jure segregation did not end until the mid-1960s,
but McCoy was integrated from its first reopening upon the start of the Korean
War until it closed in 1975—some 15 years of a very traditional, conservative
institution embodying a radically progressive social change.
The question my
digital project hopes to answer is the background—how Orlando developed from
1875 to 1950. To be clear, my research question is far more broad, its chronological
scope will extend into the 21st century. My guiding notion is that context
means a firm grasp on the before and after of the historical subject matter. I
looked at the “Ask a question” and “Search for Answers” parts of Bill Ferster’s
ASSERT model last week, and this week I finish his prescriptive acronym.
Structure
This is by far my biggest
challenge in establishing my proof of concept on my timeline. There are several
thousand individual parcels of land—far too many to scrape from the property
records office, but I have been reassured that The extraordinary requests liaison
for the Orange County Property Records Office is out of the office until next
week, but once I meet with her I’m hoping I’ll be able to establish a
systematic way to get a representative sample of land parcels and their
registration with the city to trace through Orlando’s development as a city.
Ultimately, I hope
to combine the parcel data with federal and state census data, particularly
with reference to race. However, that will be outside the scope of the initial
phase of the project. I will need to compile, or more hopefully/realistically
find, a database of property records that include the date of a parcel’s
earliest registration with the county.
![]() |
| The Holiday Inn on Alafaya and Colonial was constructed in 1989. |
![]() |
| Surprisingly, one of the last businesses before Bithlo was built in 1968. |
Envision the Answer
Ultimately, I see an
animation that shows some 500-1500 land parcels (as many as I can with the time and resource constraints I'm working with) as they were developed over the
past 120 years, from the turn of the 20th century to the present. Once
I have addresses and dates, it will just be a matter of curating the best map—probably
a composite of geological land survey maps from the early-to-mid-twentieth
century—and superimposing it over Google Earth, then using automation software
to plot the points over a span of 240 seconds (4 minutes, derived from 2
seconds per year for 120 years). If the data for subdivisions and other neighborhoods
is unmistakable enough, I might animate road construction as well, but time is unlikely
to permit that, and it might yield a questionable return on investment—wherever
land is being registered with the county, a road has likely been built to that
parcel. The hope is that the animation will provide a compelling visual answer
to the development question of Orlando, and that by going back and forth across
the timeline patterns of settlement and development will be recognizable.
Represent the Visualization
Obviously, I will be
working with a vector map. The aesthetic questions will be aural—what kind of
sound would compliment the data being visualized?—and geometric—what kinds of
polygons or dots will best illustrate urban development and expansion? I’m not
too concerned about the congruence or apprehension principles, as my question
is directly addressing time, and the animation will be playable at variable speed,
a standard feature of VisualEyes and other similar software.
Tell a Story Using
Data
What is exciting
about this stage is that unlike Slave Voyages or many other famous geographic
animations, I really am not sure what story will emerge. The basic fact—the steady
growth over 145 years from a city of fewer than 100 people to nearly 300,000—lends
itself to innumerable narratives and angles. I honestly don’t even know what
geographic features may have drawn people in what directions; it will be super
fascinating to see the convergence of communities around Winter Park, Lake
Eola, and Pine Castle, and perhaps all around Orlando’s 31 lakes. Time, and of
course the data, will tell!
Tuesday, October 15, 2019
ASSERT and Orlando
![]() |
| Orlando, Florida satellite image, and Green Swamp Wilderness Preserve satellite image, showing the same land areas. The images' centers are 40 miles from each other. |
How did what we now call Orlando, Florida transform from former Seminole Indian territory with fewer than one hundred permanent settlements in 1885 to a city of more than 285,000 people in 2019? Of course, the simple answer is a combination of human migration and reproduction, but the more compelling answer is visual: less than five generations ago, Orlando's 114 square miles were all wetlands and some 31 lakes, and now they are a jungle of asphalt, concrete, human residences, businesses, and massive amusement parks. The images above illustrate the change starkly. Green Swamp Wilderness Preserve speckled with lakes is the Orlando of 135 years ago.
The question above is my application of Bill Ferster's Interactive Visualization: Insight Through Inquiry, specifically his ASSERT model:
Ask a question
Search for data
Structure the information
Envision ways to answer
Represent the data
Tell a meaningful story
Ferster's work is a powerful synthesis of more than 500 years of tradecraft in visualization, if one starts with Leonardo da Vinci's Vitruvian Man circa 1490. I must pay tribute to the design excellence of his introductory chapter: somewhere on page 37 my fascination with his historical survey of visualization theory and practice gave way to impatience--is he going to digest all that for his readers into a practical application? It was not even 200 more words of reading after thinking this before he introduced his ASSERT model, and my confidence in it is immeasurably greater having read his stimulating establishment of his bona fide expertise. Bluntly, I would not have the perseverance to read through (or even skim and distill) the score of theoretical works on data visualization that Ferster summarizes, but I am grateful that he did.
Asking my question is easy, but the search for data is where the challenge begins. Based on a initial foray into the historical maps at the Library of Congress and the David Rumsey Historical Map Collection, I am fairly confident that I will be able to start with several dozen maps starting with mid-19th century military surveys and broadly trace out the city's development. Then come the devilish details: what level of detail do I want to shoot for? How comfortable am I with distortions for simplicity of visualization, e.g., animating road development at a constant rate when there were almost certainly delays of various kinds? In the next week or two, I hope to identify my primary sources and the ways I will structure them so as to make them manageable. I'm already envisioning ways I will represent the data, mainly through an animation that connects the historical maps to each other in a transparent way (making the potential distortions recognizable), and then perhaps too ambitiously I hope to provide a zoom feature and an easily adjusted timeline slider.
I've already learned something surprising that will make it into the story: Orlando's population growth is not nearly as much the story of NASA and the Walt Disney Corporation as one would think. Indeed, a line graph of the city's population over time shows no unmistakable spikes in the 1960s and 1970s--just a continuation of a more or less steady natural increase. Hopefully, through the construction of an interactive visualization of the city's growth and development, I will recognize more about the forces that have driven Orlando's birth and transformation geographically over the past 135 years.
Questions for Dr. Ferster:
- What practical visualization software is the best 'gateway drug' for historians who are interested in but intimidated by interactive visualizations?
- Even 19th century positivist historians wanted readers to interact with their texts in meaningful ways--how qualitatively different are modern historians' interactive visualizations from traditional prose-text scholarship?
Tuesday, October 8, 2019
Mapping Segregation in Washington D.C. Review
![]() |
| A typical deed restricted community's racist clause, legally invincible before 1948. |
There is no historian of the modern American South more preeminent than C. Vann Woodward. He took the base metal boosterism of Atlanta journalist Henry Grady’s “New South” and transmuted it into the academic gold of “New South studies.” His Origins of the New South, 1877-1913 started many of the conversations that divide academics more than six decades after its publication, however it is the book that came from a series of 1954 lectures for which Woodward is most remembered. In The Strange Career of Jim Crow, he argues--fairly implacably--that the very nature of the antebellum economy made segregation impossible, and so the militant de jure segregation of the South of his day was a relatively new phenomenon. The races had socially commingled before, and so there was reasonable hope that they could again--it was for this argument that Martin Luther King Jr. referred to Strange Career as “the historical bible of the civil rights movement”--that and the fact that Woodward was in Dr. King’s audience when he said it.
Whatever the “forgotten alternatives” were to segregation in the South, it was the reality of Woodward’s day...and now. Indeed, the hope that undergirded Strange Career was part of a movement that saw legally-supported and enforced segregation end and then a push to integrate that census data suggests was exhausted by the early 1980s. It would not be a gross oversimplification to say that the past 60 years has broadly traced an arc of racial progress: from a clash of racist conservatives (conservative strictly with reference to race relations, to be clear) and hopeful progressives, through an apex of idealism with the election of several black mayors in American cities, down to the current Big Sort that tragically makes de facto segregation appear not radically different.
It is in this context that the Mapping Segregation in Washington D.C. Project provides a powerful digital case study. It began in 2014, and has a well-focused frame:
For the past several years, Mapping Segregation in Washington DC has been documenting the historic role of real estate developers, citizens associations (white homeowner groups), and the courts in segregating the city. Our work has been focused on documenting properties subject to racially restrictive deed covenants, which barred the sale or rental of housing to African Americans.
Put simply, Mapping Segregation tells a powerful story. It uses traditional tools of evidence, many legal documents, photographs, and other archived materials. But it also leverages GIS and as the name suggests, powerfully illustrates the painful reality of Jim Crow’s stubborn legacy. If white Southerners have any pathological tendency, it is to convince themselves that the last major steps toward racial harmony were taken sometime in the last generation, always the last generation. The three presentations that form the heart of the project--what they call the “special exhibit”--form a potent prescription to that disease.
It is noteworthy that the scanned images of the deeds and the analytical narratives are necessary prerequisites to recognizing the full significance of the GIS work. The second presentation in particular, “Legal Challenges to Racially Restrictive Covenants” is extremely well designed, combining traditional scholarly writing with digital humanities tools, and a user-interface that is as aesthetically pleasing, for lack of a better term, as it is intuitive. Here is a sample:
My one quibble is that the landing page lacks a comprehensive survey of the project’s scope and aim--one has to explore a bit for the power and content of the material to come into focus. The project leaders, Sarah Shoenfeld and mara Cherkasky, would benefit from some reflection on what routes would get traffic to their site, and how they could compellingly grab visitors with the story from the landing page. Doing so could only help us to recover some of the forgotten alternatives to the racial acrimony and isolation we continue to suffer.
Tuesday, October 1, 2019
Holocaust Geographies
In April of 1993 the
United States Holocaust Memorial Museum opened. As it closes out its third
decade of operation, it still boasts an annual operating budget in excess of
$100 million and continues to promote rich scholarship on the most infamous
event in world history up to this point, hopefully ever. A little over a decade after the USHMM opened,
its researchers “saw potential in GIS for managing and analyzing” enormous data
sets they had compiled (p. 7). Marc Jean Masurovsky, a senior researcher at the
museum, contacted Tim Cole, Alberto Giordano, and Anne Kelly Knowles, to
organize a workshop on using GIS analysis. “The two week workshop brought
together nine scholars from historical geography, GIScience, cartography,
history, and architectural history to consider how spatial analysis and geographical
visualization of the built environment and forced movement of people during the
Holocaust might inspire new research questions and pedagogical applications” (Geographical Review, “Geographies of the
Holocaust,” p. 564).
Six essays ultimately
resulted from the two years of research launched by that August 2007 workshop,
and they are combined in Geographies of
the Holocaust. In their introduction, the three organizing scholars
summarize their distinct contribution in two clusters of dense text, dubiously
combined below:
One of the most unusual
aspects of our approach to the Holocaust is our conviction that spatial
analysis and geovisualization can complement and help specify the humanistic
understanding of space and place by exploring and quantifying relationships
among things and people to discover and visualize spatial patterns of activity.
This complementarity is at the core of our research method. […] By using these
methods, we do not claim to be describing the reality of experience; rather, we
have done something different by modeling the physical conditions of the
reality in which victims, perpetrators, and bystanders operated, in order to
ask new questions and see historical circumstances in a new light. (pp. 4-5, 8)
The key questions for me
are:
1.
Are the new
questions profound?
2.
Is the new
light truly illuminating of human experience?
Alas, I’m afraid that the
best I can offer is a hung jury on these questions. I fear that I selected the
weakest case study for my test case, “Bringing the Ghetto to the Jew:
Spatialities of Ghettoization in Budapest” by Tim Cole and Alberto Giordano.
Cole is a social historian and Giordano a professor of geography, and unfortunately
they both seem to suffer from on of the most acute cases of hyper-specialization
I have yet encountered in the academy.
Their geo-chronological
frame is Budapest, Hungary, from May to November, 1944—one city for six months.
Within their frame they show that Jews in Budapest were confined to ghettos
that consisted of the traditional “Jewish” parts of town. Yellow starts were
painted onto the sides of apartment buildings, and Jewish movement was
increasingly confined, but they still lived fairly integrated within the city’s
urban core. According to the authors, this runs contra to the grand narrative
of “concentration and segregation” presented by Raul Hilberg in The Destruction of European Jews. To
compare to a less academic source, the notion of the “dispersed ghetto” of
Budapest contrasts sharply with the depiction of the ghetto of Krakow, Poland
in Steven Spielberg’s Schindler’s List.
The essay is strongest when focusing on how this affected the experience of
Budapest’s increasingly sequestered Jewish population. “Jews were segregated
not only from non-Jews but also from other Jews, and from the means of
survival, through the erection of invisible walls of distance” (p. 148). This
is a valuable contribution.
Almost totally
unaccountably, Cole and Giordano never state the obvious reason for Budapest’s
anomalous status from the “classic” ghetto model: it was assembled ad hoc in
the last year of the war. The Germans did not even being to implement the
ghettoization process until June 16—ten days after the D-Day invasion that
added the Western front to the Eastern and Southern. To be fair, they
acknowledge that the ghettos were late in the war, but fail to identify that
context as at least a major part of the explanation for why time and resources
were not spent cordoning off several blocks of the city’s periphery to concentrate
the tens of thousands of Jewish people who had not escaped by that point in the
war. By this point Jewish victims—two thirds of which would not survive the
year—would have looked at the Allied advance with hope, while their Gentile neighbors
would have probably experienced some mixture of dread and ambivalence.
In the opening to their
chapter, the authors note that Budapest was one of 150 Hungarian cities and
towns that went through ghettoization in 1944. Four years after the publication
of Holocaust Geographies, the USHMM
published a list of 42,500
ghettos and camps. GIS has a role to play in helping to visualize these
places and the people who lived within them, but it must not disregard the
historical forest for the spatial trees, or one might say rather a different
spatial forest. Otherwise the “complementarity” will be lost.
Wednesday, September 25, 2019
Historical GIS
![]() |
| Playing with a 1953 land survey map and 2019 Google Earth. |
I love the chutzpah of Anne Kelly Knowles’s first two lines
of her opening essay, “GIS and History” of Placing History. “This book
argues that scholars’ use of geographic information systems (GIS) is changing
the practice of history. Time will tell whether the argument is prophetic or
premature.” The implication is that there is no question of the argument’s
truth, only the timeline of it—GIS will change the practice of history.
Fortunately, her bravado is well-grounded (pun intended): the book proceeds to
demonstrate several non-trivial ways that GIS had already been the primary tool
for historical enquiry for many established historians—a demographic anti-futurist
almost by definition. So what is historical GIS?
Knowles identifies four key characteristics, which may be
summarized as 1) geography significantly driving the history, 2) geography forming
a significant part of the historical evidence, 3) the use of a geo-chronological
database, and 4) the presentation of maps as part of the historian’s argument. What
Placing History needs to show to prove its central argument is that the
use of geography for enquiry, evidence, and argument does not merely deepen
existing understandings based on the traditional textual sources; it needs to
show that in some cases it has challenged existing narratives so compellingly
that GIS history has changed the overall historiography for a topic. This it
does, but first let’s examine the pedagogical values of GIS.
The late Robert Churchill identifies four “distinct benefits”:
1) the inculcation of “analytical and problem-solving strategies,” 2) the
demonstration of the value of the “visualization,” 3) engagement with “social,
economic, and political issues,” (he cites the Gulf War, indigenous land
claims, personal privacy questions, redlining, and gerrymandering) and 4) an interdisciplinary
bridge. The first two points are more compelling to me personally, but his
arguments are all valid. Amy Hillier then grabs the baton and argues more
practically for all the ways that historical GIS reaches “a technology-savvy
generation,” as well as some valuable advice concerning praxis. The use of the
1896 social class maps of W.E.B. Dubois provide a powerful example of the power
of GIS in a somewhat counter-intuitive way: Dubois recognized the power of spatial
illustration so clearly that he took months of innumerable labor hours to
produce what GIS can now reproduce in a fraction of the time.
So aside from teaching history, what is the impact of GIS on
history itself? Frankly, I’m amazed by how much it had already done at the time
of the publication of Placing History, some 15 years ago. The nature of
empire when communication traveled at the speed of a horse (not even wearing a
saddle) has been the subject of much historical enquiry going at least as far
back as Edward Gibbon. With Richard Taibert and Tom Elliot’s GIS-powered
analysis and manipulation of the medieval Peutinger map, our understanding is
deepening. This almost inverts Walter Benjamin’s powerful point about the lost
of auras in the age of mechanical reproduction—with digital reproduction, I
feel closer to the lives of the roughly 50 million citizens, slaves, and
subjects of the Roman Empire. Peter K. Bol’s work did much the same for
imperial China, as well as stretching my understanding of how insignificant a
boundary might have been, even to a provincial governor. What does all this
imply for history? I think as we become more comfortable leaning on geographic
data as evidence the idea of visualizations as distractions from historical
narrative will diminish, and the history will be stronger. No less
significantly, historians will begin to appreciate more and more just how profound
the old cliché that the map is not the territory really is—when you think about
it, the are few things more breathtakingly oversimplifying than a smooth
boundary line.
Historical scholarship will not abandon their bias for the
written record anytime soon, and I’m inclined to think that’s a good thing.
However, maps, spatial data, and GIS are all on the rise, both leading up to Placing
History and continuing since. I have long believed that the curious
historical relationship between space and time has been under-examined and under-utilized.
Speaking extremely broadly, since the rise of the recording industry, popular
music seems to entail in some cases a lag of decades from urban centers to
least populated areas. When my sister and her husband arrived in the remote
village of Kapchorwa, Uganda in 2005, Dolly Parton and Kenny Rogers were extremely
popular there—more than two decades after their peak popularity in the United
states. That is an extremely trivial example, but tracking the movement of more
catalyzing culture such as Uncle Tom’s Cabin, labor union songs, or private
schools since 1954. I think the key will be harnessing movement and sound in
the representation of data. This will necessitate historical scholarship not
being confined to the physical codex, but to a great extent we’re already
there.
Wednesday, September 18, 2019
Data Mining
Queen Victoria's journals--the work of a single person--amount to over sixty million words. By comparison, the 1611 Authorized Translation of the Bible has 785 thousand words, and the total word count of the seven books of the Harry Potter series is 1.1 million words. So imagine a close-reading purist specializing in the history of the British Empire in the Victorian era. The scholar would need to take 70 hours of work for an initial reading, not counting a single pause to jot down a note--for a single source.
All of
this underscores the practical value of machine-powered distant reading. Take
the example above, in which the Google Ngram viewer has shown the percentage of
books each year starting in 1800 that contain the phrase “social equality” or “racial
equality.” From 1920 onward, the relationship between these two distinct ideas
is unmistakable. It is also super stimulating to consider the theoretical
explanations for the peaks and troughs. Intuitively, the cataclysm of World War
II seems to have played a role in a dramatic increase in discussion of social
and racial equality. The trough between 1945 and 1953 followed by a wave that
peaked in 1972 matches near-perfectly to the grand narrative of the Short Civil
Rights Movement.
Data mining
is super valuable, but at present its immediately recognizable uses are limited
to heuristics (a way to discover lines of research) and stimulating
visualization (a way to illustrate a point). I’m a traditionalist and have been
involved in critical scholarship for more than a decade, so the following is
going to sound, well, overly critical. That said, I want to declare that I feel
extremely glad to be a student of history at this moment in history, because we
are in the process of significantly increasing our power to make sense of the
past. Still, equanimity requires some issues to be acknowledged.
As a
begrudging student of Jacques Derrida and Michel Foucault, I must point out
that the complexity of language. Perhaps the best illustration of this is the
simple fact that I am treating a topic that has been much-discussed by tens of
thousands, if not millions of people in the English-speaking world, and yet every
single sentence in this blog post is unique—not a single sentence placed within
quotation marks can be found using Google, besides this once the algorithmic spiders
have found this page. So, to take the example above, social and racial equality
may be expressed in innumerable ways—circumspect language, synonyms, ironic
expression, and slang all distort the measurement of the ideas that the two phrases
I searched for represented. Have Americans thought and written less and less
about liberty since 1800, or have they simply preferred to call it freedom?
Optical character recognition will get better, but undoubtedly whole wars have been missed because a computer thought they were wans. I’m honestly not too worried about that, because the distortions are probably statistically uniform across our various samples. My biggest concern is with the samples themselves.
Google
proudly declares that they have scanned more than 25 million books. Before the
Ngram analysis will be a source of reliable insights for me, I need to know
more about these 25 million books. How does the high proportion of scientific
journals that many humanists have raised concerns about affect the sample? Is
it geographically concentrated in particular areas besides correspondence to the
population density? All of these things are major potential issues.
Distant reading already has and will continue to empower us to understand our past--particularly the past century--but we need to keep the insights of the linguistic turn in our minds as we realize that potential.
Wednesday, September 11, 2019
Digital History Questions
Just to give my many as-yet-only-theoretical readers a little context, I am currently taking a called Digital Tools for Historians, and in this second week we are continuing to reflect on theory. Dr. French has asked that we consider five big questions in light of eight articles spanning the past twenty years of history as an academic field. Below is my admittedly in artful reflections on these questions.
What is Digital History?
Douglas Seefeldt and William G. Thomas answered this question directly and succinctly just over a decade ago (and counting--take a glance at the timestamp of this blog, historians of the future). It is "an approach to examining and representing the past hat works with the new communication technologies of the computer, the internet network, and software systems." The Big Question, to my mind, is whether digital history is best described as a new set of tools useful for an increasing percentage of historians or a fundamental transformation of what historians do. Even with laughable hyperbole like Louis Rossetto of Wired Magazine heralding "social changes so profound that their only parallel is probably the discover of fire," I actually lean toward fundamental transformation rather than powerful new tools. Yes, historians before and after the digital revolution both seek to understand the past, but it seems as though what understanding means has changed. This is a gross oversimplification, but history's focus on explanation has been replaced in digital history with a concern for visualization.
The sensationally-obtuse economic analysis of Time on the Cross forty-five years ago provides a nifty cautionary tale for historians too in the grips of the insights of Charles Beard. However, we have not yet repented of our preoccupation with quantification. A good friend drolly remarked that social scientists cannot resist taking two unquantifiable aspects of human life (joy, honor, beauty, etc...) and assigning them to the x-and-y-axes of a graph. Digital historians do this less than psychologists, but we still do it. More often we report on the frequency of sets of words and then sheepishly acknowledge we can only infer possible explanations of the significance of word-use.
How does 21st century Digital History theory/practice differ from earlier applications of computer technology to historical research, such as the data-driven quantitative history (“cliometrics”) of the 1970s?
As we close out the second decade of the twenty-first century, it still seems as though digital history is in something of a pubescent stage. In brief, we are still swimming around in data, unsure of the best tools to make sense of it. My tone has been a bit too sour up to this point, so I'm going to pivot a bit and say that the past twenty years has clearly shown growth and a lot of good history has been done digitally. It's just that we're still waiting for that hockey stick exponential rise on the line graph of powerful digital contributions, and I might be getting somewhat impatient.
To answer the header question directly, current digital history is far more focused on experience and scholarly networking than turning everything into numbers, though as indicated above I think we still have further to go before we are no longer guilty of overemphasizing measurement and sample size. I think we are also more wisely trying to make as much source-material available to each other for conversation and collaboration rather than just use the new tools for ourselves and refer readers to an appendix for our data and methodology.
How does Digital History differ from Digital Humanities?
My analysis here is probably going to be pretty facile, but digital history is a subset of digital humanities that is more concerned with understanding the human past rather than human nature most broadly. For that reason, we tend to be more staid and conservative in our approach to the use of digital tools than other digital humanists, and have used tools that fill gaps within our traditional paradigm, e.g., Global Information Systems (generally, "where on earth did this happen" is much more important to a historian than a sociologist).
What are the promises/perils of doing Digital History?
The promise of Digital History is the ability to reach a wider audience than ever before, having studied a great number of more diverse sources than ever before, having produced for that audience a product that appeals more directly to their senses. Hyperlinking allows connections to be made with so much less work, and storage is so much easier and information more easily-retrieved than ever before, and with every sign that it will get progressively more easy.
To my mind, the most significant peril is what has happened to performance art, journalism and photography--the barrier to entry is now so low that the supply is overwhelming demand. Just like Broadway, where you can't make a living, only a killing, the structure of the economy within the academy is experiencing a severe strain, and readjustment seems about as vital as it is inevitable. Historians also fret over how much digital documentation is being lost to the maw of archaic operating systems and a lack of back-compatibility, but I'm uncompelled. We already get more data everyday than anyone could review in a hundred lifetimes, so I think future historians will be okay.
Can we make Digital History, as a field, more inclusive?
Professors Sharon Leon and Sheila Brennan argue pretty implacably that yes, we can do a great deal better, Leon diagnosing and Brennan pointing to tools for a cure. While I found Leon's work a tad uncharitable to her field, given that the second wave of feminism just crashed less than two generations ago, it was nevertheless disturbing to read of so many important female contributions that have been and continue to be marginalized. My big gripe is that Brennan and Leon outline a lot of the smoke and just a bit of the fire of injustice women and other groups have suffered from the white male academy, the wood is out of frame. Why are misogyny, racism, and cultural elitism such barnacles on the human heart and mind? If I have learned anything in the past decade, it is that (self) awareness can never be taken for granted.
Thursday, September 5, 2019
Open Access and Scholars' Costs of Living
[Since I'm reflecting on copyright law, I will not include any of the images from the surveillance footage of Aaron Swartz entering the MIT wiring closet--you can Google it and imagine it here]
In September of 2010, a software technician working for the
digital library JSTOR noticed something highly unusual: more than 200,000 separate
“sessions” downloading academic articles from their server, all coming from the
MIT library. Three months later, the ‘hacktivist’ Aaron Swartz was arrested for
breaking and entering. Swartz had used a connection within a wiring closet to
download the articles to his laptop. These charges were later dropped when authorities
discovered that 1) Swartz was a Harvard research fellow entitled to access the
server and 2) though the closet was supposed to have controlled access, it had
been unlocked. However, he was charged with the following federal crimes: wire
fraud, computer fraud, unlawfully obtaining information from a protected
computer, recklessly damaging a protected computer, aiding and abetting, and
criminal forfeiture—crimes with a maximum possible sentence of fifty years. He
refused a plea bargain that would have entailed six months in a federal prison,
and on January 11, 2013, the 26-year-old hanged himself before his trial could
begin.
The story of Aaron Swartz is a tragedy, one that illustrates
in the starkest terms the tension explored in chapter nine of Digital
Humanities: A Primer: the altruistic impulse to share knowledge as freely
as possible, and the need for the scholars working to acquire that knowledge to
make a living. Part of Swartz’s legacy is greater attention to the gravity of
this problem. There is much in this knot: the commodification of knowledge, the
value of the role of gatekeepers and those who maintain digital infrastructure,
the question of scholarly independence from the source of their funding, and
what just compensation for work even means, just to name a few.
As an educator who has been immersed in economics for more
than a year now as part of my job, I was bemused by this line: “[M]ost scholars
see their work’s value in hiring, tenure and promotion (HTP) terms, not in
terms of the commercial marketplace, and they are quite willing to distribute
their work as freely as possible” (157). In brief, digital humanists must
consider the big picture—the context to use a favorite word of historians—if they
hope to continue to buy groceries and pay their bills through their
scholarship. The following passage succinctly summarizes the current moment exactly
as a primer should:
The late twentieth century therefore inherited two vibrant models of access to information and knowledge: one the patronage model sustained by individual, institutional or governmental resources and the other the commercial model built on the ability to produce vast amounts of inexpensive print for the broadest market possible. (159)
Space requires a bit of a leap, so I will just say that for
me it boils down to the following: scholars need to consider who we want to
write for—the general public or a specialized audience? In almost any topic, there
are elementary concepts that will be necessary for the general public but
tedious for specialists (they already know all this), and highly
technical questions that the specialists will find stimulating and the general
public somewhere between boring and intolerable. From that observation, it
strikes me that the most efficient division of labor requires honest
soul-searching of each scholar. If I am not just fluent but enjoy the most esoteric
and complex aspects of my topic, I should explore those areas in writing
intended for fellow specialists. If I respect these issues but find the
language difficult and/or my interest wanes as the analysis gets more abstract,
I should write for a popular audience. Both kinds of scholars contribute
something valuable, and both kinds of scholars should write with a mind to
bridge the gap between the two groups. ‘Minding the gap’ will make the expert’s
writing clearer and more accessible, and the popular writer’s writing more
accurate.
None of the above solves the problem of open access and
vocational compensation, but it is part of the solution. Unless a new model is
forged, experts will need to be content with their writing being hidden behind pay
walls of various kinds, and institutions like JSTOR taking a beefy cut, and
popular writers will need to be content not being featured in the most
prestigious journals.
Sources:
Dean, John, and John W. Dean. “Dealing With Aaron Swartz in
the Nixonian Tradition: Overzealous Overcharging Leads to a Tragic Result.” Verdict
Comments, 14 Mar. 2018,
verdict.justia.com/2013/01/25/dealing-with-aaron-swartz-in-the-nixonian-tradition.
MacFarquhar, Larissa. “The Darker Side of Aaron Swartz.” The
New Yorker, The New Yorker, 10 July 2019,
www.newyorker.com/magazine/2013/03/11/requiem-for-a-dream.
Subscribe to:
Posts (Atom)









