Just to clarify my position, I don't actually think that the En. lexeme
“w*rd” is easy to define, precise or theoretically well-founded (I
prefer “lexeme” here, as Ada's previous use of “term” is improper from a
wusterian point of view, given that “w*rd” lacks distinctive traits due
to its notorious ambiguity).
The situation is similar in mathematics where “number” is used to denote
a variety of concepts such as natural numbers, integers, fractions, real
numbers, irrational numbers, imaginary numbers … which may be inclusive
or exclusive of each other. There are thus numerous contexts in which
colloquial use of the w*rd “number” would be imprecise, inappropriate
and might even lead to confusion. Nonetheless, I'm not aware of any
mathematicians who advocate censorship of the w*rd “number”.
If “w*rd” lacks a clear definition and a clear theoretical foundation
(which I actually agree with), then it can't really be used as a “term”
until the concept has been given an adequate definition in relation to
other terms within the relevant domain or theoretical framework.
On the other hand, though precise terminology is always preferable
whenever and wherever precision is necessary, there's nothing ever to be
gained scientifically through censorship (sorry to use an ungood w*rd,
but, in all earnestness, when I see a spade I call it a “spade”).
DH
On 20/06/2022 22:13, Daniel HENKEL wrote:
Not to mention all these shamefully unscientific posts on Corporalist:
/12th International Global W*rdnet Conference Donostia / San
Sebastian, Basque Country 23-27, 2023 Global W*rdnet Association:
www.globalw*rdnet.org//
//Conference website: https://hitz.eus/gwc2023/
/18th Workshop on Multiw*rd Expressions (MWE 2022) Organized and
sponsored by SIGLEX, the Special Interest Group on the Lexicon of the ACL/
/The 5th Workshop on Multi-w*rd Units in Machine Translation and
Translation Technology (MUMTTT 2022) Malaga, 30th September 2022/
...
Definitely time for some lexical/terminological restrictions/updates,
for the sake of goodthink/processing, and science!
(actually "science" is heretical/redundant, "goodthink/processing"
will do the job:
/"As we have already seen in the case of the word FREE, w*rds which
had once borne a heretical meaning were sometimes retained for the
sake of convenience, but only with the undesirable meanings purged out
of them. Countless other w*rds such as HONOUR, JUSTICE, MORALITY,
INTERNATIONALISM, DEMOCRACY, SCIENCE, and RELIGION had simply ceased
to exist."/)
DH
On 20/06/2022 21:47, Daniel HENKEL wrote:
Looks as if Linguistlist is in need of some scientific enlightenment
as well :
http://linguistlist.org/issues/33/33-2063.html
/In the new, thoroughly revised second edition of W*rds of Wonder:
Endangered Languages and What They Tell Us, Second Edition (formerly
called Dying W*rds: Endangered Languages and What They Have to Tell
Us), renowned scholar Nicholas Evans delivers an accessible and
incisive text covering the impact of mass language endangerment. The
distinguished author explores issues surrounding the preservation of
indigenous languages, .../
(ungood w*rds unw*rded to protect the faint of mind against ungood
thinking/processing).
Best,
DH
On 20/06/2022 20:27, Ada Wan wrote:
(I just expounded on a point as a twitter reply today re the
granularity of one's thinking/processing. Pls feel free to read that
also.)
One can think of it in a less binary manner --- not "good" vs "bad",
not "words" then "sentences", but to think of an utterance/sequence
with all the finer connections in between... That is the beauty of
language --- from a "philological" point of view.
I am not sure, though, if you were speaking from a scientific
perspective, because I have a paper to back my argument in that regard.
On Mon, Jun 20, 2022 at 6:06 PM Sylvain Kahane <sylv...@kahane.fr>
wrote:
“We’re destroying words–scores of them, hundreds of them, every
day. We’re cutting the language down to the bone.” […]
“It’s a beautiful thing, the destruction of words. Of course the
great advantage is in the verbs and adjectives, but there are
hundreds of nouns that can be got rid of as well. It isn’t only
the synonyms; there are also the antonyms. After all, what
justification is there for a word which is simply the opposite
of some other words? A word contains its opposite in itself.
Take ‘good,’ for instance. If you have a word like ‘good,’ what
need is there for a word like ‘bad’? ‘Ungood’ will do just as
well–better, because it’s an exact opposite, which the other is
not. Or again, if you want a stronger version of ‘good,’ what
sense is there in having a whole string of vague useless words
like ‘excellent’ and ‘splendid’ and all the rest of them?
‘Plusgood’ covers the meaning, or ‘doubleplusgood’ if you want
something stronger still. Of course we use those forms already,
but in the final version of Newspeak there’ll be nothing else.
In the end the whole notion of goodness and badness will be
covered by only six words–in reality, only one word. Don’t you
see the beauty of that, Ada?…”
George Orwell, 1984
> Le 20 juin 2022 à 17:33, Ada Wan <adawan...@gmail.com> a écrit :
>
> Hi Christopher,
>
> It is of the best interest of the community to discontinue the
usage of "word". The term is not only very shaky in its
foundation (if any), but it can also effect disparity in
performance in computational processing and robustness when
human evaluation is involved.
> Despite the term has been casually adopted by many in the
past, like many un-PC terms that may have an inappropriate
undertone, it needs to be discouraged and abandoned.
> Last but not least, I noticed that you are located in Canada,
in the event that you were to work with any indigenous
communities, one MUST be advised to be careful with the usage of
such term --- you could be imposing your own (EN- / FR- /
dominant language-centric) view onto another
individual/community. There is an element of cultural and
linguistic hegemony with the usage of such term (including and
not limited to making applications with it).
> Please also consult recent work in this area:
https://openreview.net/forum?id=-llS6TiOew.
>
> Feel free to get in touch if you should have any questions.
>
> Best,
> Ada
>
>
> On Mon, Jun 20, 2022 at 4:53 PM Christopher Collins
<christopher.coll...@ontariotechu.ca> wrote:
> Hello,
>
>
>
> I’m looking for any open source or cloud-hosted solution for
complex word identification or word difficulty rating in French
for a reading application.
>
>
>
> As a backup plan we can use measures like corpus frequency,
length, number of senses, but we’re hoping someone has already
made a tool available.
>
>
>
> We found this but that’s it: https://github.com/sheffieldnlp/cwi
>
>
>
> Would appreciate any tips!
>
>
>
> Thanks,
>
>
>
> Chris
>
>
>
> Christopher Collins [he/him]
> Associate Professor - Faculty of Science
> Canada Research Chair in Linguistic Information Visualization
> Ontario Tech University
> vialab.ca <http://vialab.ca>
>
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list -- corpora@list.elra.info
> To unsubscribe send an email to corpora-le...@list.elra.info
> _______________________________________________
> UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
> Corpora mailing list -- corpora@list.elra.info
> To unsubscribe send an email to corpora-le...@list.elra.info
_______________________________________________
UNSUBSCRIBE from this page:http://mailman.uib.no/options/corpora
Corpora mailing list --corpora@list.elra.info
To unsubscribe send an email tocorpora-le...@list.elra.info
--
Daniel HENKEL <https://univ-paris8.academia.edu/DanielHENKEL>
/Maître de Conférences (Linguistique et Traduction)
UFR5 LLCE-LEA • EA1569 TransCrit/
Université Paris 8 Vincennes-St-Denis
/“non si può stendere una tipologia delle traduzioni, ma al massimo
una tipologia di diversi modi di tradurre, volta per volta negoziando
il fine che ci si propone
– e volta per volta scoprendo che i modi di tradurre sono più di
quelli che sospettiamo.”/ U. Eco
_______________________________________________
UNSUBSCRIBE from this page:http://mailman.uib.no/options/corpora
Corpora mailing list --corpora@list.elra.info
To unsubscribe send an email tocorpora-le...@list.elra.info
--
Daniel HENKEL <https://univ-paris8.academia.edu/DanielHENKEL>
/Maître de Conférences (Linguistique et Traduction)
UFR5 LLCE-LEA • EA1569 TransCrit/
Université Paris 8 Vincennes-St-Denis
/“non si può stendere una tipologia delle traduzioni, ma al massimo
una tipologia di diversi modi di tradurre, volta per volta negoziando
il fine che ci si propone
– e volta per volta scoprendo che i modi di tradurre sono più di
quelli che sospettiamo.”/ U. Eco
_______________________________________________
UNSUBSCRIBE from this page:http://mailman.uib.no/options/corpora
Corpora mailing list --corpora@list.elra.info
To unsubscribe send an email tocorpora-le...@list.elra.info
--
Daniel HENKEL <https://univ-paris8.academia.edu/DanielHENKEL>
/Maître de Conférences (Linguistique et Traduction)
UFR5 LLCE-LEA • EA1569 TransCrit/
Université Paris 8 Vincennes-St-Denis
/“non si può stendere una tipologia delle traduzioni, ma al massimo una
tipologia di diversi modi di tradurre, volta per volta negoziando il
fine che ci si propone
– e volta per volta scoprendo che i modi di tradurre sono più di quelli
che sospettiamo.”/ U. Eco_______________________________________________
UNSUBSCRIBE from this page: http://mailman.uib.no/options/corpora
Corpora mailing list -- corpora@list.elra.info
To unsubscribe send an email to corpora-le...@list.elra.info