Hi, For simple corpus exploration, I agree that it would be a better default. In our case, we’re using our own pipeline, to be coherent with higher-level annotations (such as POS) - nothing wrong with the Seeder
Best, Hugues > Le 25 févr. 2021 à 14:41, Peter Klügl <[email protected]> a écrit : > > Hi, > > > I am thinking about changing the default value of the seeder parameter > in the RutaEngine from DefaultSeeder to TextSeeder. I think TextSeeder > (no MARKUP annotations) is a better default value in most use cases. > > > Are there opinions on that? > > > Best, > > > Peter > > > -- > Dr. Peter Klügl > Head of Text Mining/Machine Learning > > Averbis GmbH > Salzstr. 15 > 79098 Freiburg > Germany > > Fon: +49 761 708 394 0 > Fax: +49 761 708 394 10 > Email: [email protected] > Web: https://averbis.com > > Headquarters: Freiburg im Breisgau > Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080 > Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó >
