Re: [INDOLOGY] Search interface for the GRETIL Corpus

Arlo Griffiths via INDOLOGY Wed, 07 May 2025 07:52:31 -0700

Dear Claudius,

Thanks a lot for this initiative. Allow me to ask if it is also possible to 
resume absorbing texts into the same corpus?


Now that its Göttingen host no longer seems to be interested in curating it, 
why not store all files on github or gitlab and initiate a collective INDOLOGY 
endeavor toward curating (txt > xml conversion) and expanding the corpus?

I write these words without having a full understanding of everything that 
would be required, but I'd certainly be interested in contributing.

Best wishes,

Arlo Griffiths
EFEO








________________________________
From: INDOLOGY <[email protected]> on behalf of Claudius 
Teodorescu via INDOLOGY <[email protected]>
Sent: Tuesday, April 22, 2025 9:00 AM
To: Indology <[email protected]>
Subject: [INDOLOGY] Search interface for the GRETIL Corpus

Dear all,

During the last months, I managed to set a search interface for the texts of 
the GRETIL Corpus, located at [1]. The interface is published as a static 
website, with a static full-text index and a static search engine, which 
execute the search in the browser, without the need for a server.

In order to convert the files to HTML format, which is used to display them in 
the search interface, I had to make some small updates to the XML files of the 
corpus. These changes are documented in [2]. As one expects, there is still 
work to be done with the XML files of the corpus.

Please let me know if you find any bugs with the search interface.

Best regards,
Claudius Teodorescu

[1] https://claudius-teodorescu.gitlab.io/gretil-corpus-site/
[2] https://gitlab.com/claudius-teodorescu/gretil-corpus-data

_______________________________________________
INDOLOGY mailing list
[email protected]
https://list.indology.info/mailman/listinfo/indology

Re: [INDOLOGY] Search interface for the GRETIL Corpus

Reply via email to