14.0262 methodological primitives

From: by way of Willard McCarty (willard@lists.village.Virginia.EDU)
Date: 09/22/00

  • Next message: by way of Willard McCarty: "14.0264 Convergence, Winter 2001: adoption and diffusion of new media technologies"

                   Humanist Discussion Group, Vol. 14, No. 262.
           Centre for Computing in the Humanities, King's College London
                   <http://www.princeton.edu/~mccarty/humanist/>
                  <http://www.kcl.ac.uk/humanities/cch/humanist/>
    
       [1]   From:    Wilhelm Ott <zrlot01@zdv.uni-tuebingen.de>          (71)
             Subject: Re: 14.0258 methodological primitives?
    
       [2]   From:    lachance@chass.utoronto.ca (Francois Lachance)      (68)
             Subject: Re: 14.0258 methodological primitives?
    
       [3]   From:    "Osher Doctorow" <osher@ix.netcom.com>              (31)
             Subject: Re: 14.0258 methodological primitives?
    
    
    --[1]------------------------------------------------------------------
             Date: Fri, 22 Sep 2000 09:41:49 +0100
             From: Wilhelm Ott <zrlot01@zdv.uni-tuebingen.de>
             Subject: Re: 14.0258 methodological primitives?
    
    Dear Willard (and Humanist participants),
    
    of course, the idea of "methodological primitives" is useful.
    You may be aware that, with TUSTEP, we have tried (for more than
    30 years now) to take this idea as a basis for a software which
    is not only flexible but also guarantees that the user can really
    take over the responsibility for the results - since he is able
    (and has to do it) to define in every detail the "basic" operations
    to be carried out by the computer. For a short description of the
    concept, you are invited to consult
        http://www.uni-tuebingen.de/zdv/tustep/tustep_ox92.html
    (a paper given 1992 at the ALLC-ACH conference in Oxford).
    
    
    I disagree however regarding two of the three examples for the
    low-level end of "methodological primitives":
    
    
    1. "lemmatizing the word forms of an inflected language":
    
    Let me argue from my own experience:
    
    It took us (Father Bonifatius Fischer and his team and myself)
    more than two years to lemmatize the latin inflected word forms
    occuring in the text of the Vulgate, though we had at our disposal
    the "Lexicon Electronicum Latinum" which Father Busa had compiled
    for his Index Thomisticus and though had written a software which
    automatically added, with the help of this Lexicon, the lemma to
    each inflected form - a procedure which produced an average of
    2,5 proposals for each inflected form! We sorted the material back
    into the text sequence, in order to have the context available for
    the "manual" decision which of the proposed lemmatizations was the
    correct one.
    
    Expecting that a piece of software at the level of "methodological
    primitives" could decide if "facies" is, at a given location, a
    verb form or a substantive is far from reality in near future - though
    30 years have gone since we lemmatized the Vulgate; also today,
    even "higher level" software like SYSTRAN (which is e.g. offered for
    translating web pages found by Altavista) fails to take the context
    into account (and therefore translates the term "content provider"
    into the German "der zufriedene Anbieter").
    
    
    2. "Alphanumeric sorting":
    
    Yes, if you have in mind a routine which rearranges a file according
    to the sequence of bits at certain locations of every record,
    irrespective if they are to be interpreted as numbers or as digits
    and letters, I agree. But this is far from what I would call
    "alphanumeric ordering". In TUSTEP, we did not even try to fully
    automize the steps necessary to arrive at a required ordering.
    Not only are there different rules for almost every language
    (the EC has only recently established a norm for multilingual
    ordering in EU documents, see http://www.stri.is/TC304/EOR), but
    sometimes (as for German) even concurrent rules, depending on the
    type of index you generate (e.g lexicon on the one hand, telephone
    directory  or library catalogue at the other). And for scholarly
    purposes, also historical (or even unregulated) orthographies must
    be covered. So, what is needed is more elementary than "alphanumeric
    ordering": it is a routine which allows to construct arbitrarily
    complex and multiple "ordering keys" from a given text or part of
    text, which then allows to obtain the required order with the help
    of a "mechanical" ordering procedure which has to do nothing than to
    rearrange the records according to the ascending value of the bit
    string contained in the ordering key. The scholar must however have
    the possibility to define in every detail the rules for constructing
    the ordering keys.
    
    Therefore, also "alphanumeric sorting" is, for these purposes,
    too complex a procedure to be put on the lower level of
    methodological primitives.
    
    A short summary of what we regard as "basic operations for text data
    processing" which should be at the disposal of a textual scholar can be
    found at http://www.uni-tuebingen.de/zdv/tustep/tdv_eng.html#b
    
    Yours,
    Wilhelm Ott
    ---------------------------------------------------------------------
    Prof. Dr. Wilhelm Ott                 phone: +49-7071-2970210
    Universitaet Tuebingen                fax:   +49-7071-295912
    Zentrum fuer Datenverarbeitung        e-mail: ott@zdv.uni-tuebingen.de
    Waechterstrasse 76                    http://www.uni-tuebingen.de/zdv/
    D-72074 Tuebingen
    
    --[2]------------------------------------------------------------------
             Date: Fri, 22 Sep 2000 09:43:10 +0100
             From: lachance@chass.utoronto.ca (Francois Lachance)
             Subject: Re: 14.0258 methodological primitives?
    
    Willard,
    
    
    I just had to pour myself a glass of stout -- one of the most primitive of
    scholarly processes -- to begin to reread and unravel that very intriguing
    quasi-grant speak paragraph...
    
    
       > I'll call these mechanical operations "methodological primitives" and
    
    why not call them "mechanical operations"?
    what is gained/lost in calling them "methodological primitives"?
    
    As I sip on my stout, I wonder if this not close to reifying a process.
    Not a bad thing in itself when looks at a systems analysis of a complex
    set of exchanges and communications across multiple nodes and
    processing centres. Flowcharts of where something is produced, housed,
    transfered, transformed and disposed, work for me. Stout is brewed, bottle
    and shippped. Yet I wouldn't call any of these either "mechancial
    operations" or "methodological primitives". Now the products and tools of
    your Humanities Computing may be an entirely different type of entity than
    a good bottle of stout.
    
      > define the type as "an algorithmically specifiable transformation of
    data
      > that forms a recognisable component of multiple scholarly processes".
    
    are not all transformation algorithmically specifiable?
    (includinging random event generators such a pausing to sip on one's
    breverage)?
    
    recognisable to whom? to all the participants across all the disciplines
    or to the "translators" at cross-disciplinary thresholds? Note, I have
    indeed moved from your "scholarly processes" to "disciplines" thus
    mimicking your initiall move from mechanical to methodological and from
    process to primitive ... (I'm sure some ale drinkers could contribute some
    fine words on the comparative etymology of "method" and "mecanic" and
    there is scarely an imbiber of single malts who would not want to venture
    and opinion as to the word roots that tangle "primitive" and "process" in
    a heady concotion given several rounds of fine peaty Islay scotch.)
    
       As an
      > idea I would suppose the type to be a useful analytic tool with which to
      > resolve what humanists do into a loosely bounded set of interoperable
      > software components which could be assembled in whatever order by a scholar
      > in order to aid his or her research.
    
    I want to do a very primitive reader process and reparse this last bit so
    I just might switch to grappa.
    
    I would heartily raise a glass of fire-water to the aid to research any
    thought about the use value of ideas (as long as it came from cows raised
    on organic feed and free to range).
    
    Actually I like the paragraph, I just would from a general systems
    perspective change one word. The "into" should be changed to "with".
    
    And I would add, draining my glass, that one of those components include
    communications software to place the "with" as the hinge between a scholar
    and the data but also between scholars. As I place my glass on the table
    before me, I insist, comtemplating the empty glass, that behind much of
    the lingo in that artfully crafted paragraph is a primitive all right, one
    called the "means of production".
    
    [this is where they haul me off before I rant and sputter and start
    quoting long passages from a book with a long history and attempt
    to reconcile that book's section on "the two fundatmental forms of
    manufacture -- heterogenous and organic" with the elegant and freighted
    word "interoperable"]
    
    
      > My question is, how strong is the idea of methodological primitives? How
      > useful? Is this a direction in which we should go?
    
    If we were to tour Scotland, I could do the islands and you could do the
    highlands, and we could meet in Thailand to compare notes. The year after
    I could follow your route and yours mine and we could meet in Helsinki to
    swap stories. OR we could swap stories while on the road. If we all went
    in the same direction ... there would be no strength to any idea.
    
    Promise to send me a postcard from where ever you plant the hops, harvest
    the grapes or milk the cows.
    
    f.
    
    --
    Francois Lachance, Scholar-at-large
    	http://www.chass.utoronto.ca/~lachance
    Member of the Evelyn Letters Project
    	http://www.chass.utoronto.ca/~dchamber/evelyn/evtoc.htm
    
    --[3]------------------------------------------------------------------
             Date: Fri, 22 Sep 2000 09:44:05 +0100
             From: "Osher Doctorow" <osher@ix.netcom.com>
             Subject: Re: 14.0258 methodological primitives?
    
    From: Osher Doctorow, osher@ix.netcom.com, Thurs. Sept. 21, 2000, 9:24PM
    
    Professor McCarty, as usual your questions are ingenious.  I do not know the
    answer.  I have been wondering something similar, although somewhat across
    disciplines.  Readers might be interested in one or two of my last few
    contributions to evolutionary-computing@mailbase.ac.uk .  Whatever the
    answer, and I certainly will look into it, we need to remember (I think) to
    keep our goals high.  In fact, I am beginning to think that people's goals
    should always be higher than they can even imagine - or maybe as high,
    depending on the way one looks at it.  I know that produces some strange
    side effects, as one might call them, including the Man From La Mancha, but
    I am beginning to change my view even of the Man From La Mancha.  I think
    that Cervantes could have had an intuition about the universe, whose
    mysteries, whose questions and answers, go far beyond what is commonly
    believed and held to be true, from prehistory and pyramids in Ancient Egypt
    to the Magna Carta and Limelight and modern times on earth, not to mention
    elsewhere.  Occasionally some angry blamers get hold of goals beyond their
    reasoning, like the Nazis who made the Holocaust and the bombing of Britain.
    But, as I indicated on evolutionary-computing, there have to be questions
    about viruses' viruses (viruses which destroy other viruses), things between
    life and viruses, things between death and viruses, things between the
    inorganic and viruses, and so on, before we can probably conquer cancer and
    AIDS and even polio and tuberculosis whose symptoms we can eliminate but
    whose causes we know nothing about.   Let us have a type and a
    methodological primitive or many for humanist computing and
    interdisciplinary computing, by all means, but let it be types for all
    seasons and all imaginings or beyond.  Let us try not to merely computerize
    machines to recreate human beings but to begin with what we know about
    humanist computing and viruses and life and science and humanities and move
    onward from there.  A Type For All Seasons.  A Type for the Viruses' Virus,
    the cancer's cancer.  Maybe even a type for the Old Man/Woman/Being, as
    Einstein thought.
    
    Osher Doctorow
    



    This archive was generated by hypermail 2b30 : 09/22/00 EDT