Dear Johannes
Re "CreoleVal":
at this point, it's more like a "one shouldn't" as opposed to whether "one
can".
The following is what I wrote to the SIGTYP, I think the message would be
similar for your initiative:
"""
---------- Forwarded message ---
Paterson III
wrote:
> @Ada
>
> >What do we do with students/graduates who were fed archaic ideals?
> You give them full professorships. ;-)
>
> - Hugh
>
> On Wed, Oct 25, 2023 at 7:29 AM Ada Wan via Corpora <
> corpora@list.elra.info> wrote:
>
>> D
example, to refer to the one and only "meaningful
> thing" that is common to the very many (theoretically infinite, practically
> probably around 10,000) strings including ağaç, ağacı, ağaca, ağaçlar,
> ağacımızdaki, ağaçlandırılabilmesinden, ağaçsızlaşmasını, etc. etc.? How
> (and why
d why it is imperative that I
>> wean myself from using lemmas.
>>
>> What is it that restricts my freedom to invent the lemma (a non-universal
>> construct) AĞAÇ-, for example, to refer to the one and only "meaningful
>> thing" that is common to the very many
, because that is precisely an example of what I would call a
> lemma.
>
> I never claimed that anything exists beyond the reality of my mind. I only
> asked why I am not allowed to talk about things that can be conjugated /
> inflected etc. and to use the word "lemma" to r
Dear Christian
Re your PS:
one doesn't need to debate the use/future of lemmatization, though I'd
welcome such as part of scholarship. For those experienced in matters in/of
Linguistics, it should be clear that lemmatization was simply a cconstruct,
a entry-level philological exercise (esp. for
reas I would see the actual content of texts as the "source data". Using
>> linked data and lemmas allows the bridge to connect via lemmas to LiLa
>> data. https://lila-erc.eu/
>>
>> Kind regards,
>> Hugh
>>
>>
>>
>> On Tue, Oct 10, 2023 at 3
to model
> data, we rely on existing resources, not inventing them from scratch. So
> the model can represent word lists as well as computational lexicons or
> many other types of lexicographic data existing out there.
>
> Best regards,
>
> Max
>
> On 18/09/2023 19:29
Dear Hugh
An alternative would be to use dictionaries (as in, "{ }" in python) to
group characters belonging to the consonant and vowel groups (or at least
one of them) and then examine accordingly. This should render more
scientific insights on sequences than relying on bigger spans of
[Please ignore if not interested]
Hi Anil
Thanks for your comments.
Just one, perhaps most important, clarification for now:
I was/am not denying the existence of metaphysics. If anything, I sometimes
think of my work as an instance of "computational phenomenology".
Thanks and best
Ada
On
Dear Michael
Since I have been kind of "at it" on NLP, I thought to be fair to not miss
out on this call, I hope similar calls could find these remarks useful:
i. NLP as CHI (Computer-Human Interaction) is fine;
ii. some information can be extracted based on text, not all; (there might
also be a
ia Corpora wrote:
>
> Dear all,
>
> I was shocked to see a vitriolic ad-hominem attack on a colleague posted
> to this mailing list. It is entirely inappropriate to post this type of
> diatribe against an individual even though someone might disagree with
> either the tone or the c
*
Thanks and best
Ada
On Mon, Aug 21, 2023 at 5:39 PM Ada Wan wrote:
> Amendment:
> In short, there are no symbolic concepts relevant in computing /
> computational processing except for those which also align with statistics.
> (There are various levels of assumptions/abstr
doing in "symbolic computing" surely deserves a critical re-examination.
On Mon, Aug 21, 2023 at 4:48 PM Ada Wan wrote:
> Dear Ben, Rodolfo, and Toms
>
> Please accept that there is a responsibility to science, technology,
> engineering, and education (or anything that we
Dear Ben, Rodolfo, and Toms
Please accept that there is a responsibility to science, technology,
engineering, and education (or anything that we undertake).
If you could point out the specific arguments as to which of what I wrote
may be problematic to you, perhaps we can have a constructive
Dear RANLP organizers
I looked through your program and have a few questions:
i. I noticed there is a parallel session for "Sentence-level Representation
and Analysis". Would you mind please letting me (or us all on this list)
know why "sentence(s)" would be relevant and necessary in computing?
Dear Juri
Thanks for your reply.
Would you mind please addressing my question(s) at least under [i] more
explicitly? That is:
Has anyone claimed that input data does not affect results in any way?
Will there be any explicit evaluation with data statistics (in combination
with textual
wrote:
> Dear Ada,
>
> Thanks a lot for giving me the opportunity to clarify these points, which
> are very important, and to do so on the public list!
>
> On 15. Aug 2023, at 13:44, Ada Wan wrote:
>
> Dear Gabriella
>
> I have 2 concerns about your post/project:
>
>
Dear Eval4NLP organizers
Thanks for your effort to keep on evaluating NLP systems.
The following formulations/initiatives have come to my attention. I have
some questions:
i. "** Shared Task ** This year’s version will come with a shared task on
explainable evaluation of generated language (MT
Dear Gabriella
I have 2 concerns about your post/project:
i. I noticed your formulation here in your call "as machine learning
approaches which rely on gold standards which average annotators’
perspectives are particularly unsuitable for the highly subjective
phenomena tackled in CSS research
ends up
shaping the social infrastructure). But there are many parallels in
language phenomena. The crux of the matter lies not in language --- that's
a message I've been trying to get across. (It took me a while to come to
terms with that.)
On Mon, Aug 14, 2023 at 9:16 AM Anil Singh w
Just a quick reply before the weekend to some of the points that I thought
deserve a short clarification:
1. re linguistic empowerment: yes and no. As I commented on X (formerly
Twitter) on 09Aug2023: "[t]here are divergent ways of thinking... but when
it comes to language and the social
ge
technologies: how to design something more universal, one that is less
centered on a particular script (instead of implicitly prompting users to
type in ASCII) and that does not require one to use so many alternate
function keys like shift, ctrl, alt, alt gr... etc.?]
Best
Ada
On Mon, Aug 7,
[resending since it looked like my reply bounced due to size of thread]
-- Forwarded message -
From: Ada Wan
Date: Mon, Aug 7, 2023 at 3:46 PM
Subject: Re: [Corpora-List] Re: Any literature about tensors-based corpora
NLP research with actual examples (and homework ;-)) you would
nguage being magical, well, that depends on what you mean by
>>> magical. To me, it seems it is magical in the same sense as life itself is
>>> magical. Nothing more, nothing less. Even computer programming I have been
>>> known to call magical in a certain sense.
>>>
>&
@Toms:
for completeness' sake: would you mind please sharing your background?
Thanks.
On Fri, Aug 4, 2023 at 5:31 PM Ada Wan wrote:
> Thanks x2, Ibrtchx.
>
> On Fri, Aug 4, 2023 at 3:30 AM Albretch Mueller wrote:
>
>> On 8/3/23, Toms Bergmanis wrote:
>> ...
>>
Thanks x2, Ibrtchx.
On Fri, Aug 4, 2023 at 3:30 AM Albretch Mueller wrote:
> On 8/3/23, Toms Bergmanis wrote:
> ...
>
> I, for one, have benefited from Ada's, as well as other member's
> suggestions and comments as I hope they have somehow benefited from
> mine.
> lbrtchx
>
thing that
> everyone should read before engaging in a debate with you.
>
>
>
> For anyone wanting to continue this discussion, I strongly recommend
> reading Ada's work, so you have an informed opinion about what evidence she
> is referring to.
>
> Sincerely,
>
> Toms Bergmanis
> -
Kind regards,
> Hugh
>
> On Wed, Aug 2, 2023 at 11:12 AM Ada Wan via Corpora <
> corpora@list.elra.info> wrote:
>
>> Re RML or any "text technologies" leveraging "grammar" (misnomer or not):
>> it is not the right time right now to be
concepts
are categorically invalid. That is not what I intended to communicate (with
all my papers, scientific work, and my comments here). It is an expert
opinion/finding that I shared, upon some careful evaluation.
On Tue, Aug 1, 2023 at 10:26 PM Albretch Mueller via Corpora <
corpora@
Dear Ibrtchx
*I. The "grammar"* I was referring to is not exactly the heuristic you
wrote about. *
One can certainly read and analyze texts/corpora/literature, I don't
disagree with that.
That having been expressed, here are a couple of points re RML that one
should pay heed:
i. to what extent
Dear Ibrtchx
Re your 1st email (dated Jul 27, 2023, 5:01 AM, UTC+2):
i. Re "no grammar": in reality. It's "made up" of (post-hoc) analyses and
normative values from language judgment based on (more/less) well-formed
data. [Of course, most of us who entered the language space didn't see it
as such
ormation that is CONFIDENTIAL. If you are not the intended recipient you
> are hereby notified that any dissemination, copy or disclosure of this
> communication is strictly prohibited by law. If this message has been
> received by mistake, please let us know immediately via e-ma
Dear all
The primary reason I got onto this thread has to do with what I sensed
might be an attempt to promote certain methodology, one that direly needs
some re-evaluation, much like many other in the space of language and
computing (and/or CL/NLP, digital humanities... etc.). I know that many
A brief reply:
may I invite you to take a look at my work from recent years, e.g. Fairness
in Representation and Representation and Bias (all versions are linked
here: https://sites.google.com/view/adawan)?
There may be a lot to abstract from my findings. But I can imagine your
obtaining some
>> made up of matrices, that is at least couples of vectors where the rows are
>> represented by embeddings. The question is do matrices represent all needed
>> semantic and syntactic properties of a sentence? I doubt it and in fact
>> when it comes to deep implicit content
Dear lbrtchx
Yes, indeed, it is possible for a string (or an expression or a lexical
item... etc.) to refer to different things based on different contexts. One
could refer to it as polysemy (or not).
Many fields have shared vocabulary items. Same character or character
strings can be used in
I have already posted this alert/remark on the ML-news posting. This is for
the Corpora-List.
-- Forwarded message -
From: Ada Wan
Date: Fri, Jul 14, 2023 at 8:44 PM
Subject: Re: [ML-news] Three PhD students, one postdoc on neurosymbolic
models
To: Alexander Koller ,
Dear
Hi David
What is the reason/purpose for your n-gram analysis or classification task?
What software have you been using?
Why not just exact string matching by characters? (Take log if length
issues go out of hand?)
Best
Ada
On Mon, Jun 26, 2023 at 1:45 PM David Beauchamp via Corpora <
th the term "expression" without further
> qualification is that to my mind it includes any kind of linguistic sign,
> including ones like "to pay a visit to my dear aunt Ruth" which can clearly
> be interpreted compositionally. So I think we do have to specify &quo
oo.
--
What do you all think?
Thanks and best
Ada
On Fri, Feb 10, 2023 at 10:58 AM Kilian Evang via Corpora <
corpora@list.elra.info> wrote:
> Forwarded message from Archna below
>
> -- Forwarded message -
> Von: Archna Bhatia
> D
Hi Archna, hi Kilian, hi all
Thanks for your replies.
TLDR on my part: I'd be fine going with "expressions" (instead of
"fixed/idiomatic expressions"). Neither "word" nor "morphology/syntax"
(apart from the ordering of elements and/or sequential patterns) is
necessary in the analyses of such.
ents that are the mainly topics.
>
> Ken (webmaster retiree)
> On 2/8/2023 10:18 AM, Ada Wan via Corpora wrote:
>
> Hi Kilian
>
> Hope all has been well.
>
> I'm surprised that people are still "wording around" nowadays. Some
> suggestions:
>
>
Hi Kilian
Hope all has been well.
I'm surprised that people are still "wording around" nowadays. Some
suggestions:
1. Can't we rename "MWEs" to "fixed/idiomatic expressions" instead? One can
reformulate these as sequences/strings/expressions of various
lengths/vocabs in characters.
2. Also, one
s the desired level of abstraction is the other way around,
> arriving at MWE's being even more of a nightmare than poor "words",
> whatever they represent.
> And sometimes, the best way is to keep using "words" with lots of policing
> around what they are, which btw might
On Tue, Jun 21, 2022 at 12:14 AM Flor, Michael wrote:
> The notion of 'word' has difficulties in linguistics.
> But not enough for abandoning it.
>
> Except we don't need it at all --- for both human or machine processing.
> The argument from the paper "Fairness in Representation for
word boundaries" involved.)
>
> Great. So now what? I assume everything I wrote is 100% mainstream, known
> to any and every linguist, half of whom could amplify and correct all the
> mistakes I've made in the above. Sure, but so what? You can't get rid of
> the concept of
ble and incisive text covering
> the impact of mass language endangerment. The distinguished author explores
> issues surrounding the preservation of indigenous languages, ...*
>
> (ungood w*rds unw*rded to protect the faint of mind against ungood
> thinking/processing).
&
d Edition (formerly called
> Dying W*rds: Endangered Languages and What They Have to Tell Us), renowned
> scholar Nicholas Evans delivers an accessible and incisive text covering
> the impact of mass language endangerment. The distinguished author explores
> issues surrounding
vering
> the impact of mass language endangerment. The distinguished author explores
> issues surrounding the preservation of indigenous languages, ...*
>
> (ungood w*rds unw*rded to protect the faint of mind against ungood
> thinking/processing).
>
> Best,
>
> DH
>
>
e notion of goodness and badness will be covered by only six words–in
> reality, only one word. Don’t you see the beauty of that, Ada?…”
>
> George Orwell, 1984
>
>
> > Le 20 juin 2022 à 17:33, Ada Wan a écrit :
> >
> > Hi Christopher,
> >
> > It is of the
Hi Christopher,
It is of the best interest of the community to discontinue the usage of
"word". The term is not only very shaky in its foundation (if any), but it
can also effect disparity in performance in computational processing and
robustness when human evaluation is involved.
Despite the
52 matches
Mail list logo