Thanks Peter, I will await your confirmation of the fix, but I guess we will then stick with 2.6.1 until the next Ruta release :)
Cheers, Mario > On 20 Sep 2019, at 18:09 , Peter Klügl <[email protected]> wrote: > > Hi Mario, > > > I did not have the chance to have a look at your example yet... > > > Most likely, this problem is already fixed in the current trunk, but I > was not able to find the time for a new release. In 2.7.0, there was a > small modification in the lexer rules for the seeding, which had > unfortunately some unintended side effects in the generated code > especially with unusual unicode characters. I'll try to verify that with > your example the next days. > > > Best, > > > Peter > > Am 19.09.2019 um 12:35 schrieb Mario Juric: >> Hi Peter, >> >> After upgrading to Ruta 2.7.0 a while ago we started getting some >> errors from the SeedLexer, which we didn’t have before. It appears >> related to odd unicode characters that we haven’t cleaned properly >> upstream, but it is consumed by the previous version 2.6.1 where our >> pipeline completes without error. I attached a small sample program >> with a dummy ruta script to reproduce it. >> >> Which version has the correct behaviour in such cases? 2.7.0 or 2.6.1? >> >> >> Cheers, >> Mario >> >> >> >> >> >> >> >> >> >> >> >> >> >> > -- > Dr. Peter Klügl > R&D Text Mining/Machine Learning > > Averbis GmbH > Salzstr. 15 > 79098 Freiburg > Germany > > Fon: +49 761 708 394 0 > Fax: +49 761 708 394 10 > Email: [email protected] > Web: https://averbis.com > > Headquarters: Freiburg im Breisgau > Register Court: Amtsgericht Freiburg im Breisgau, HRB 701080 > Managing Directors: Dr. med. Philipp Daumke, Dr. Kornél Markó >
