-
Notifications
You must be signed in to change notification settings - Fork 95
<foreign> clarifications and more alternatives #2814
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: dev
Are you sure you want to change the base?
Conversation
…reign>. Added an entry to the example bibliography. Refs #2786
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These strike me as sensible edits, and the example is brilliant! (Don't forget to add the source to BIB!)
<egXML xmlns="http://www.tei-c.org/ns/Examples" xml:id="COHQF-egXML-aa" > | ||
where there was "<del>a</del> <seg xml:lang="sco">Deel a Sole</seg> upon the Shoe" | ||
</egXML> | ||
<egXML xmlns="http://www.tei-c.org/ns/Examples" xml:id="COHQF-egXML-kbl" source="#COHQHF-eg-kbl"> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perfect example!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
These changes are a definite improvement. The only quibbles I have are tiny and have nothing to do with the intent or motivation for the ticket.
I liked the new example with Latin (which is technically foreign to everyone now), up to "Bee Master", but the following sentence threw me because I have no idea what the text is, so the sudden mention of wax-moths let me momentarily puzzled. I don't think that final sentence is needed (although it might be enjoyable if you happen to know the text), and if I put myself in the position of a non-native, non-European speaker I can imagine reaching for a dictionary to find out what a wax-moth is and wondering how it's relevant to the "foreign" thing. So I would lose that sentence.
In the Scots example, though, I feel there's somehow not enough context for the words to mean anything much; perhaps it would help to gloss the phrase, again to help non-native speakers: " = 'a [great] deal of sole', or a substantial sole" (assuming that's what it actually means).
I wonder about mentioning <emph>
here:
In these cases it is also preferable to use <gi>seg</gi> with the appropriate <att>xml:lang</att>, or <gi>emph</gi> if the text is emphasized.
If a term in a different language from the surrounding text is distinguished in some way such as italicization, I think it's most likely going to be because it's in another language rather than because it's being emphasized, and in any case, how would you distinguish the two? Perhaps it's unwise to suggest that simply being in another language might constitute a kind of emphasis. But I see your example below uses it. Is "fils" really emphasized, or just in another language and therefore italicized?
None of these are a big deal for me, though. If you want to go ahead and merge, I'll be happy.
The wax moth example is not new! I chose it many years ago. |
Thanks @martindholmes . I think @joeytakeda has asked his team for other Scots examples, so we may end up with a clearer one. In the "fils" example, the emphasis is editorial; that is, the source document is a manuscript, but the editors want the word italicized because it is in another language than its surrounding text. At same time they don't want to call it foreign because it is't in the historical context of the document. Perhaps a |
Surely that would imply that the word was italicised in the original ms? If this is all about how to render the markup in the edited version thats a job for the render ing software not the markup methinks. |
Right, yes – this is probably not a good enough example to justify the use of |
Also, while I'm here, I think an example using |
@lb42 Sorry, I must have missed the wax moth example before, or never focused on it. But my comment still stands: I think it would present a barrier to a non-native speaker (or even a native speaker who finds it difficult to skip over a puzzle). |
@raffazizzi If "fils" is not italicized in the original, then presumably there's no signal to say that it is emphasized in the original at all, so |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you @raffazizzi for your time and effort in adding this extra clarification and guidance. I welcome these edits, I agree that this is a definitive improvement to the Guidelines chapter. I like the example provided for <emph>
but I understand the concerns already raised here. Hopefully, the example that @joeytakeda has volunteered to search for would provide a better use case for .
I only have one small suggestion for line 587 in the CO chapter so that it is consistent with the phrasing of the rest of the edits.
</p> | ||
|
||
<p>The <gi>foreign</gi> element should not be used to represent foreign words | ||
<p>The <gi>foreign</gi> element should additionally not be used to represent foreign words |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps "should additionally not be used to represent words in a different language which are mentioned or glossed within the text..." to be consistent with the changes above and below?
The Keywords of Black Louisiana team also provided English translations of the documents (in French+Spanish) that they edited. They used Is this too unique a stance for the Guidelines? I think it's legitimate. Note that in this case the Here is an example that could be added to the chapter instead of the one I presented earlier. <p>
<lb/>When asked about the statement of
the <emph xml:lang="fr">sauvagesses</emph> he
<lb/>replied no, he answers the question.
</p> Which is the editor's translation of: <p>
<lb/>Intérogé Surla Cite du sauvagesses il
<lb/>répond que non, Il répond l'interoge…
</p> |
Wouldnt <distinct> be more appropriate than <emph> here?
…________________________________
From: Raff Viglianti ***@***.***>
Sent: Monday, September 22, 2025 5:26 PM
To: TEIC/TEI ***@***.***>
Cc: Lou Burnard ***@***.***>; Mention ***@***.***>
Subject: Re: [TEIC/TEI] <foreign> clarifications and more alternatives (PR #2814)
[https://avatars.githubusercontent.com/u/144770?s=20&v=4]raffazizzi left a comment (TEIC/TEI#2814)<#2814 (comment)>
The Keywords of Black Louisiana team also provided English translations of the documents (in French+Spanish) that they edited. They used ***@***.***:lang] instead of foreign whenever they retained in the translation original terminology in French or Spanish from the documents. I would say that in these cases the words are indeed foreign to English, but the editors didn't want to impose "foreignness" even on the translation of this multi-lingual and culturally sensitive material.
Is this too unique a stance for the Guidelines? I think it's legitimate. Note that in this case the emph is the TEI's authors emphasis rather than the being derived from a source document.
Here is an example that could be added to the chapter instead of the one I presented earlier.
<p>
<lb/>When asked about the statement of
the <emph xml:lang="fr">sauvagesses</emph> he
<lb/>replied no, he answers the question.
</p>
Which is the editor's translation of:
<p>
<lb/>Intérogé Surla Cite du sauvagesses il
<lb/>répond que non, Il répond l'interoge…
</p>
—
Reply to this email directly, view it on GitHub<#2814 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AAFBJ5H7UIZTJHDT4DMU4C33UAPK7AVCNFSM6AAAAACGVQAAR6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZTGMRQGAZTANZTGA>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
This PR attempts to clarify and offer alternatives to the use of
<foreign>
in multilingual texts, or texts originating from multilingual contexts. See #2786.I am initially setting this as draft to prompt feedback and suggestions on the proposed changes.