[Corpora-List] Re: Announcing CreoleVal

2023-10-31 Thread Ada Wan via Corpora
Dear Johannes Re "CreoleVal": at this point, it's more like a "one shouldn't" as opposed to whether "one can". The following is what I wrote to the SIGTYP, I think the message would be similar for your initiative: """ ---------- Forwarded message ---

[Corpora-List] Re: Lemmas and Lemmatization [was Re: NIF: NLP Interchange Format]

2023-10-26 Thread Ada Wan via Corpora
Paterson III wrote: > @Ada > > >What do we do with students/graduates who were fed archaic ideals? > You give them full professorships. ;-) > > - Hugh > > On Wed, Oct 25, 2023 at 7:29 AM Ada Wan via Corpora < > corpora@list.elra.info> wrote: > >> D

[Corpora-List] Re: [External] Re: NIF: NLP Interchange Format

2023-10-25 Thread Ada Wan via Corpora
example, to refer to the one and only "meaningful > thing" that is common to the very many (theoretically infinite, practically > probably around 10,000) strings including ağaç, ağacı, ağaca, ağaçlar, > ağacımızdaki, ağaçlandırılabilmesinden, ağaçsızlaşmasını, etc. etc.? How > (and why

[Corpora-List] Re: Lemmas and Lemmatization [was Re: NIF: NLP Interchange Format]

2023-10-25 Thread Ada Wan via Corpora
d why it is imperative that I >> wean myself from using lemmas. >> >> What is it that restricts my freedom to invent the lemma (a non-universal >> construct) AĞAÇ-, for example, to refer to the one and only "meaningful >> thing" that is common to the very many

[Corpora-List] Re: [External] Re: NIF: NLP Interchange Format

2023-10-25 Thread Ada Wan via Corpora
, because that is precisely an example of what I would call a > lemma. > > I never claimed that anything exists beyond the reality of my mind. I only > asked why I am not allowed to talk about things that can be conjugated / > inflected etc. and to use the word "lemma" to r

[Corpora-List] Re: NIF: NLP Interchange Format

2023-10-17 Thread Ada Wan via Corpora
Dear Christian Re your PS: one doesn't need to debate the use/future of lemmatization, though I'd welcome such as part of scholarship. For those experienced in matters in/of Linguistics, it should be clear that lemmatization was simply a cconstruct, a entry-level philological exercise (esp. for

[Corpora-List] Re: NIF: NLP Interchange Format

2023-10-17 Thread Ada Wan via Corpora
reas I would see the actual content of texts as the "source data". Using >> linked data and lemmas allows the bridge to connect via lemmas to LiLa >> data. https://lila-erc.eu/ >> >> Kind regards, >> Hugh >> >> >> >> On Tue, Oct 10, 2023 at 3

[Corpora-List] Re: Lemon and Noncognitive morphology

2023-09-19 Thread Ada Wan via Corpora
to model > data, we rely on existing resources, not inventing them from scratch. So > the model can represent word lists as well as computational lexicons or > many other types of lexicographic data existing out there. > > Best regards, > > Max > > On 18/09/2023 19:29

[Corpora-List] Re: Lemon and Noncognitive morphology

2023-09-19 Thread Ada Wan via Corpora
Dear Hugh An alternative would be to use dictionaries (as in, "{ }" in python) to group characters belonging to the consonant and vowel groups (or at least one of them) and then examine accordingly. This should render more scientific insights on sequences than relying on bigger spans of

[Corpora-List] Re: RANLP 2023 Call for Participation

2023-09-01 Thread Ada Wan via Corpora
[Please ignore if not interested] Hi Anil Thanks for your comments. Just one, perhaps most important, clarification for now: I was/am not denying the existence of metaphysics. If anything, I sometimes think of my work as an instance of "computational phenomenology". Thanks and best Ada On

[Corpora-List] Re: Research Position on Computational Models of Misunderstanding in Germany

2023-08-29 Thread Ada Wan via Corpora
Dear Michael Since I have been kind of "at it" on NLP, I thought to be fair to not miss out on this call, I hope similar calls could find these remarks useful: i. NLP as CHI (Computer-Human Interaction) is fine; ii. some information can be extracted based on text, not all; (there might also be a

[Corpora-List] Re: RANLP 2023 Call for Participation

2023-08-29 Thread Ada Wan via Corpora
ia Corpora wrote: > > Dear all, > > I was shocked to see a vitriolic ad-hominem attack on a colleague posted > to this mailing list. It is entirely inappropriate to post this type of > diatribe against an individual even though someone might disagree with > either the tone or the c

[Corpora-List] Re: RANLP 2023 Call for Participation

2023-08-22 Thread Ada Wan via Corpora
* Thanks and best Ada On Mon, Aug 21, 2023 at 5:39 PM Ada Wan wrote: > Amendment: > In short, there are no symbolic concepts relevant in computing / > computational processing except for those which also align with statistics. > (There are various levels of assumptions/abstr

[Corpora-List] Re: RANLP 2023 Call for Participation

2023-08-21 Thread Ada Wan via Corpora
doing in "symbolic computing" surely deserves a critical re-examination. On Mon, Aug 21, 2023 at 4:48 PM Ada Wan wrote: > Dear Ben, Rodolfo, and Toms > > Please accept that there is a responsibility to science, technology, > engineering, and education (or anything that we

[Corpora-List] Re: RANLP 2023 Call for Participation

2023-08-21 Thread Ada Wan via Corpora
Dear Ben, Rodolfo, and Toms Please accept that there is a responsibility to science, technology, engineering, and education (or anything that we undertake). If you could point out the specific arguments as to which of what I wrote may be problematic to you, perhaps we can have a constructive

[Corpora-List] Re: RANLP 2023 Call for Participation

2023-08-20 Thread Ada Wan via Corpora
Dear RANLP organizers I looked through your program and have a few questions: i. I noticed there is a parallel session for "Sentence-level Representation and Analysis". Would you mind please letting me (or us all on this list) know why "sentence(s)" would be relevant and necessary in computing?

[Corpora-List] Re: Second Call for Papers: The 4th Workshop on Evaluation and Comparison of NLP Systems (Eval4NLP 2023)

2023-08-15 Thread Ada Wan via Corpora
Dear Juri Thanks for your reply. Would you mind please addressing my question(s) at least under [i] more explicitly? That is: Has anyone claimed that input data does not affect results in any way? Will there be any explicit evaluation with data statistics (in combination with textual

[Corpora-List] Re: Postdoc and PhD position in NLP/CL/CSS at GESIS (Cologne) - deadline: Sept 5/6

2023-08-15 Thread Ada Wan via Corpora
wrote: > Dear Ada, > > Thanks a lot for giving me the opportunity to clarify these points, which > are very important, and to do so on the public list! > > On 15. Aug 2023, at 13:44, Ada Wan wrote: > > Dear Gabriella > > I have 2 concerns about your post/project: > >

[Corpora-List] Re: Second Call for Papers: The 4th Workshop on Evaluation and Comparison of NLP Systems (Eval4NLP 2023)

2023-08-15 Thread Ada Wan via Corpora
Dear Eval4NLP organizers Thanks for your effort to keep on evaluating NLP systems. The following formulations/initiatives have come to my attention. I have some questions: i. "** Shared Task ** This year’s version will come with a shared task on explainable evaluation of generated language (MT

[Corpora-List] Re: Postdoc and PhD position in NLP/CL/CSS at GESIS (Cologne) - deadline: Sept 5/6

2023-08-15 Thread Ada Wan via Corpora
Dear Gabriella I have 2 concerns about your post/project: i. I noticed your formulation here in your call "as machine learning approaches which rely on gold standards which average annotators’ perspectives are particularly unsuitable for the highly subjective phenomena tackled in CSS research

[Corpora-List] Re: math and language/text data (continued from thread "Re: Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...")

2023-08-14 Thread Ada Wan via Corpora
ends up shaping the social infrastructure). But there are many parallels in language phenomena. The crux of the matter lies not in language --- that's a message I've been trying to get across. (It took me a while to come to terms with that.) On Mon, Aug 14, 2023 at 9:16 AM Anil Singh w

[Corpora-List] Re: math and language/text data (continued from thread "Re: Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...")

2023-08-11 Thread Ada Wan via Corpora
Just a quick reply before the weekend to some of the points that I thought deserve a short clarification: 1. re linguistic empowerment: yes and no. As I commented on X (formerly Twitter) on 09Aug2023: "[t]here are divergent ways of thinking... but when it comes to language and the social

[Corpora-List] Re: math and language/text data (continued from thread "Re: Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...")

2023-08-08 Thread Ada Wan via Corpora
ge technologies: how to design something more universal, one that is less centered on a particular script (instead of implicitly prompting users to type in ASCII) and that does not require one to use so many alternate function keys like shift, ctrl, alt, alt gr... etc.?] Best Ada On Mon, Aug 7,

[Corpora-List] math and language/text data (continued from thread "Re: Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...")

2023-08-07 Thread Ada Wan via Corpora
[resending since it looked like my reply bounced due to size of thread] -- Forwarded message - From: Ada Wan Date: Mon, Aug 7, 2023 at 3:46 PM Subject: Re: [Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-08-05 Thread Ada Wan via Corpora
nguage being magical, well, that depends on what you mean by >>> magical. To me, it seems it is magical in the same sense as life itself is >>> magical. Nothing more, nothing less. Even computer programming I have been >>> known to call magical in a certain sense. >>> >&

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-08-04 Thread Ada Wan via Corpora
@Toms: for completeness' sake: would you mind please sharing your background? Thanks. On Fri, Aug 4, 2023 at 5:31 PM Ada Wan wrote: > Thanks x2, Ibrtchx. > > On Fri, Aug 4, 2023 at 3:30 AM Albretch Mueller wrote: > >> On 8/3/23, Toms Bergmanis wrote: >> ... >>

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-08-04 Thread Ada Wan via Corpora
Thanks x2, Ibrtchx. On Fri, Aug 4, 2023 at 3:30 AM Albretch Mueller wrote: > On 8/3/23, Toms Bergmanis wrote: > ... > > I, for one, have benefited from Ada's, as well as other member's > suggestions and comments as I hope they have somehow benefited from > mine. > lbrtchx >

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-08-04 Thread Ada Wan via Corpora
thing that > everyone should read before engaging in a debate with you. > > > > For anyone wanting to continue this discussion, I strongly recommend > reading Ada's work, so you have an informed opinion about what evidence she > is referring to. > > Sincerely, > > Toms Bergmanis > -

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-08-03 Thread Ada Wan via Corpora
Kind regards, > Hugh > > On Wed, Aug 2, 2023 at 11:12 AM Ada Wan via Corpora < > corpora@list.elra.info> wrote: > >> Re RML or any "text technologies" leveraging "grammar" (misnomer or not): >> it is not the right time right now to be

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-08-02 Thread Ada Wan via Corpora
concepts are categorically invalid. That is not what I intended to communicate (with all my papers, scientific work, and my comments here). It is an expert opinion/finding that I shared, upon some careful evaluation. On Tue, Aug 1, 2023 at 10:26 PM Albretch Mueller via Corpora < corpora@

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-07-31 Thread Ada Wan via Corpora
Dear Ibrtchx *I. The "grammar"* I was referring to is not exactly the heuristic you wrote about. * One can certainly read and analyze texts/corpora/literature, I don't disagree with that. That having been expressed, here are a couple of points re RML that one should pay heed: i. to what extent

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-07-28 Thread Ada Wan via Corpora
Dear Ibrtchx Re your 1st email (dated Jul 27, 2023, 5:01 AM, UTC+2): i. Re "no grammar": in reality. It's "made up" of (post-hoc) analyses and normative values from language judgment based on (more/less) well-formed data. [Of course, most of us who entered the language space didn't see it as such

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-07-26 Thread Ada Wan via Corpora
ormation that is CONFIDENTIAL. If you are not the intended recipient you > are hereby notified that any dissemination, copy or disclosure of this > communication is strictly prohibited by law. If this message has been > received by mistake, please let us know immediately via e-ma

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-07-26 Thread Ada Wan via Corpora
Dear all The primary reason I got onto this thread has to do with what I sensed might be an attempt to promote certain methodology, one that direly needs some re-evaluation, much like many other in the space of language and computing (and/or CL/NLP, digital humanities... etc.). I know that many

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-07-25 Thread Ada Wan via Corpora
A brief reply: may I invite you to take a look at my work from recent years, e.g. Fairness in Representation and Representation and Bias (all versions are linked here: https://sites.google.com/view/adawan)? There may be a lot to abstract from my findings. But I can imagine your obtaining some

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-07-25 Thread Ada Wan via Corpora
>> made up of matrices, that is at least couples of vectors where the rows are >> represented by embeddings. The question is do matrices represent all needed >> semantic and syntactic properties of a sentence? I doubt it and in fact >> when it comes to deep implicit content

[Corpora-List] Re: Any literature about tensors-based corpora NLP research with actual examples (and homework ;-)) you would suggest? ...

2023-07-25 Thread Ada Wan via Corpora
Dear lbrtchx Yes, indeed, it is possible for a string (or an expression or a lexical item... etc.) to refer to different things based on different contexts. One could refer to it as polysemy (or not). Many fields have shared vocabulary items. Same character or character strings can be used in

[Corpora-List] Re: Six PhD students and one postdoc: Neurosymbolic Models of Language, Vision, and Action - Saarbrücken, Germany

2023-07-16 Thread Ada Wan via Corpora
I have already posted this alert/remark on the ML-news posting. This is for the Corpora-List. -- Forwarded message - From: Ada Wan Date: Fri, Jul 14, 2023 at 8:44 PM Subject: Re: [ML-news] Three PhD students, one postdoc on neurosymbolic models To: Alexander Koller , Dear

[Corpora-List] Re: Counting multiple long (9+) n-grams in corpora: request for approaches

2023-06-27 Thread Ada Wan via Corpora
Hi David What is the reason/purpose for your n-gram analysis or classification task? What software have you been using? Why not just exact string matching by characters? (Take log if length issues go out of hand?) Best Ada On Mon, Jun 26, 2023 at 1:45 PM David Beauchamp via Corpora <

[Corpora-List] Re: Fwd: Deadline extension: 19th Workshop on Multiword Expressions (MWE 2023)

2023-02-11 Thread Ada Wan via Corpora
th the term "expression" without further > qualification is that to my mind it includes any kind of linguistic sign, > including ones like "to pay a visit to my dear aunt Ruth" which can clearly > be interpreted compositionally. So I think we do have to specify &quo

[Corpora-List] Re: Fwd: Deadline extension: 19th Workshop on Multiword Expressions (MWE 2023)

2023-02-10 Thread Ada Wan via Corpora
oo. -- What do you all think? Thanks and best Ada On Fri, Feb 10, 2023 at 10:58 AM Kilian Evang via Corpora < corpora@list.elra.info> wrote: > Forwarded message from Archna below > > -- Forwarded message - > Von: Archna Bhatia > D

[Corpora-List] Re: Fwd: Deadline extension: 19th Workshop on Multiword Expressions (MWE 2023)

2023-02-09 Thread Ada Wan via Corpora
Hi Archna, hi Kilian, hi all Thanks for your replies. TLDR on my part: I'd be fine going with "expressions" (instead of "fixed/idiomatic expressions"). Neither "word" nor "morphology/syntax" (apart from the ordering of elements and/or sequential patterns) is necessary in the analyses of such.

[Corpora-List] Re: Deadline extension: 19th Workshop on Multiword Expressions (MWE 2023)

2023-02-08 Thread Ada Wan via Corpora
ents that are the mainly topics. > > Ken (webmaster retiree) > On 2/8/2023 10:18 AM, Ada Wan via Corpora wrote: > > Hi Kilian > > Hope all has been well. > > I'm surprised that people are still "wording around" nowadays. Some > suggestions: > >

[Corpora-List] Re: Deadline extension: 19th Workshop on Multiword Expressions (MWE 2023)

2023-02-08 Thread Ada Wan via Corpora
Hi Kilian Hope all has been well. I'm surprised that people are still "wording around" nowadays. Some suggestions: 1. Can't we rename "MWEs" to "fixed/idiomatic expressions" instead? One can reformulate these as sequences/strings/expressions of various lengths/vocabs in characters. 2. Also, one

[Corpora-List]Re: Complex Word Identification in French

2022-06-23 Thread Ada Wan
s the desired level of abstraction is the other way around, > arriving at MWE's being even more of a nightmare than poor "words", > whatever they represent. > And sometimes, the best way is to keep using "words" with lots of policing > around what they are, which btw might

[Corpora-List]Re: [EXTERNAL] Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
On Tue, Jun 21, 2022 at 12:14 AM Flor, Michael wrote: > The notion of 'word' has difficulties in linguistics. > But not enough for abandoning it. > > Except we don't need it at all --- for both human or machine processing. > The argument from the paper "Fairness in Representation for

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
word boundaries" involved.) > > Great. So now what? I assume everything I wrote is 100% mainstream, known > to any and every linguist, half of whom could amplify and correct all the > mistakes I've made in the above. Sure, but so what? You can't get rid of > the concept of

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
ble and incisive text covering > the impact of mass language endangerment. The distinguished author explores > issues surrounding the preservation of indigenous languages, ...* > > (ungood w*rds unw*rded to protect the faint of mind against ungood > thinking/processing). &

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
d Edition (formerly called > Dying W*rds: Endangered Languages and What They Have to Tell Us), renowned > scholar Nicholas Evans delivers an accessible and incisive text covering > the impact of mass language endangerment. The distinguished author explores > issues surrounding

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
vering > the impact of mass language endangerment. The distinguished author explores > issues surrounding the preservation of indigenous languages, ...* > > (ungood w*rds unw*rded to protect the faint of mind against ungood > thinking/processing). > > Best, > > DH > >

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
e notion of goodness and badness will be covered by only six words–in > reality, only one word. Don’t you see the beauty of that, Ada?…” > > George Orwell, 1984 > > > > Le 20 juin 2022 à 17:33, Ada Wan a écrit : > > > > Hi Christopher, > > > > It is of the

[Corpora-List]Re: Complex Word Identification in French

2022-06-20 Thread Ada Wan
Hi Christopher, It is of the best interest of the community to discontinue the usage of "word". The term is not only very shaky in its foundation (if any), but it can also effect disparity in performance in computational processing and robustness when human evaluation is involved. Despite the