[tei-council] Note on 2372570, "element for punctuation marks"
David Sewell
dsewell at virginia.edu
Sun Mar 29 16:11:28 EDT 2009
This might indeed be a case where the best solution would be for someone
like Alexei who needs the element to add it via ODD customization.
I'm open to being convinced in that direction & I think some in-person
discussion will be helpful.
On Sun, 29 Mar 2009, Lou Burnard wrote:
> I just want to point out that the "level of abstraction" purportedly
> introduced by the entity reference markup proposed in P4 was a snare and a
> delusion, like any other semantic markup based on the use character entity
> names, which is why we got rid of it. If you want to introduce semantic
> distinctions amongst your punctuation marks, you must do it with proper XML
> markup constructs like distinct elements, attribute values or whatever.
>
> Without wishing to prejudge the issue, I remain to be convinced that we need
> something different from <g> or <c> to do this. It's hard enough explaining
> the difference between <g> and <c>. I'm not looking forward to explaining why
> the TEI has three different ways of tagging punctuation marks.
>
>
>
> David Sewell wrote:
>> As an addendum to this note, if you compare the P3 section on "Treatment of
>> Punctuation"
>>
>> http://www.tei-c.org.uk/Vault/GL/P3/CO.htm#COPU
>>
>> with the P5 version
>>
>> http://www.tei-c.org/release/doc/tei-p5-doc/en/html/CO.html#COPU
>>
>> it seems that the TEI-specific entity names in P3 offered a level of
>> abstraction for punctuation marks that we have lost in P5 and that the
>> proposed <punct> element would restore.
>>
>> (For what it's worth, the word "punctuation" has occurred in only 3
>> different threads since 1997 on TEI-L.)
>>
>> David
>>
>> On Sun, 29 Mar 2009, David Sewell wrote:
>>
>>> ["Adopt-a-RED" note]
>>>
>>> Submitter: Alexei Lavrentev
>>>
>>> Reference:
>>> http://sourceforge.net/tracker/?func=detail&aid=2372570&group_id=106328&atid=644065
>>>
>>> DISCUSSION:
>>>
>>> Alexei has presented an extended argument for the addition of a <punct>
>>> element in his revised TEI MM 2008 paper, available here:
>>>
>>> http://sourceforge.net/tracker/download.php?group_id=106328&atid=644065&file_id=303629&aid=2372570
>>>
>>> I think we're going to need to talk about this face to face; I don't
>>> feel competent to make a yes/no recommendation without further
>>> discussion. I would urge everyone to read the paper ahead of our meeting
>>> (it's only 5 pages), out of courtesy to Alexei if nothing else, as he is
>>> one of the local organizers.
>>>
>>> My own opinion is that he makes a strong case for a dedicated
>>> punctuation element. I think he's right that within the context of
>>> Linguistic Segment Categories (17.1), <c> makes much more sense as
>>> markup for characters that can be part of words or morphemes. I'm not in
>>> a position to evaluate his arguments based on automated language
>>> processing or medieval manuscript practice, but I can immediately see an
>>> application in an area I'm more familiar with, encoding of manuscript
>>> verse by Emily Dickinson. Her use of the dash is notorious for its
>>> polysemy, and attempts have been made to characterize different types of
>>> dash. From that point of view there is no single dash "character",
>>> rather a set of idiosyncratic punctuation markers that may or may not
>>> take identifiable distinct forms. Having a <punct> element available
>>> would simplify interpretive markup of her verse.
>>>
>>> David
>>>
>>>
>>
>
>
--
David Sewell, Editorial and Technical Manager
ROTUNDA, The University of Virginia Press
PO Box 801079, Charlottesville, VA 22904-4318 USA
Courier: 310 Old Ivy Way, Suite 302, Charlottesville VA 22903
Email: dsewell at virginia.edu Tel: +1 434 924 9973
Web: http://rotunda.upress.virginia.edu/
More information about the tei-council
mailing list