What the the exact error you're getting? Can you please send me the Moses.ini
file you used and an example of your input sentence
Sent while bumping into things
> On 5 Dec 2013, at 02:44, "Aaron L.-F. Han" wrote:
>
> Hi Hieu,
>
> We want to know whether the tree-to-tree based SMT on Moses can
Hi Hieu,
We want to know whether the tree-to-tree based SMT on Moses can be tuned
like the phrase-based translation?
If the tree-to-tree based SMT can be tuned, what is the tuning command?
We have tried several times using the following setting, they did not work:
1. use Chinese-English parsed bi
To match a given orthography in Unicode, you can use Unicode Character
Properties, which in Perl regexes looks like /\p{Bengali}/ for example.
See the following link for more explanation:
http://perldoc.perl.org/perlunicode.html#*Scripts*
So an oversimplified example:
echo "à¦à¦¾à¦°à¦¤ is a vast c
Many thanx to your replies!
One more question concerning these scripts: are there capability to detach
translation models to free RAM and attached new TMs while the decoder is
running? If not maybe you can provide a roadmap for me to contribute such
functionality?
Kind regards!
04.12.2013, 1
Hi,
I need a viewer which would display the two sides of a parallel corpus
side-by-side, so that I can check for sanity etc. I remember I used to
have one of them, but it has been some time, so I cannot find it right
now. Could somebody give me a quick suggestion?
best
Lefteris
--
MSc. Inf.
Hi,
I am building the baseline system based on Moses manual instructions.
I have installed Moses, GIZA++ and IRSTLM as mentioned in the manual.
The corpus preparation (tokenization, ...cleaning) steps also goes well.
However when I move to Language Model Training: I have some problems
I am fol
Hi,
it is possible to specify multiple translation tables
and train different weights for them.
Alternatively, you can merge them into a single
table. See some information about this here:
http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc52
The models cannot be updated while the decoder
Калинин Александр writes:
>
> Hello, everyone!
>
> I have the following question: I have trained a huge translation model
trained on 1m sentences of news
> texts. Also I have several minor translation models trained on texts of
different domains. Are there any
> tools in Moses that can enable m
Hi,
To suggest transliterated words to moses where the source was not seen
in the training data (otherwise transliterating those words in the
training data would probably be the better way) you can use the XML
markup as described here:
http://www.statmt.org/moses/?n=Moses.AdvancedFeatures#ntoc
@Thomas, I dont think his intention was to search for a given utf8 string;
it was more like to identify strings in a different language.
@Pranjal, when you say unicode words, understand that everything (even the
english words) are encoded in utf8. Its more like you want to pick the
Bengali (read a
Hello, everyone!
I have the following question: I have trained a huge translation model trained
on 1m sentences of news texts. Also I have several minor translation models
trained on texts of different domains. Are there any tools in Moses that can
enable merging translation models? Is it poss
Hi,
echo " is a vast country" | perl -pe '/()/; print $1."\n"'
if you want to replace it with something:
echo " is a vast country" | perl -pe 's//something/g'
But this is normally not the list to ask perl questions to...
Best,
Thomas
On 04/12/13 10:55, Pranjal Das wrote:
> can
can anyone help with a perl script to extract unicode words from a
sentence. For eg. i want to extract the word ভারত from the sentence "ভারত
is a vast country"..
* Pranjal Das*
Department of Information Technology,
Institute of Science and Technology,
Gauhati University,Guwahati,Assam
Phone- +91-83
hii
How to extact unicode character...भ ारत .. from the sentence 'भ ारत is a
vast country and replace it back to the sentance as Bharat is a vast
country. using a perl program..
Regards,
*Kalyanee Kanchan Baruah*
Department of Information Technology,
Institute of Science and Technology,
G
14 matches
Mail list logo