Re: Russian Language Model for Joshua
I was able to download it thanks! ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Director, Information Retrieval and Data Science Group (IRDS) Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA WWW: http://irds.usc.edu/ ++ On 7/17/16, 5:40 PM, "Matt Post" <p...@cs.jhu.edu> wrote: >I don't mind hosting it, and JHU hasn't complained, but it's an ugly URL. > >matt > > >> On Jul 16, 2016, at 5:45 PM, Mcgibbney, Lewis J (398M) >> <lewis.j.mcgibb...@jpl.nasa.gov> wrote: >> >> Can you make this public for good? Or is it the size which is the issue? >> Is this build using master branch Matt? I am having issues building models >> with masterŠ I¹ll post my issues on another thread. >> >> Dr. Lewis John McGibbney Ph.D., B.Sc. >> Data Scientist II >> Computer Science for Data Intensive Applications Group 398M >> Jet Propulsion Laboratory >> California Institute of Technology >> 4800 Oak Grove Drive >> Pasadena, California 91109-8099 >> Mail Stop : 158-256C >> Tel: (+1) (818)-393-7402 >> Cell: (+1) (626)-487-3476 >> Fax: (+1) (818)-393-1190 >> Email: lewis.j.mcgibb...@jpl.nasa.gov >> >> >> >> Dare Mighty Things >> >> >> >> >> >> >> >> >> >> >> >> On 7/16/16, 1:09 PM, "Matt Post" <p...@cs.jhu.edu> wrote: >> >>> Done: >>> >>> http://cs.jhu.edu/~post/tmp/ru.kenlm >>> 4106251755 bytes, sha1sum: 5c894e24dafa42bc44a5bb6822812d6234eda791 >>> >>> Let me know when you have it so I can delete it. >>> >>> matt >>> >>> >>>> On Jul 15, 2016, at 4:42 PM, Matt Post <p...@cs.jhu.edu> wrote: >>>> >>>> All right, started trying to recompile. If you have a machine with > >>>> 256 GB of memory, it might be more efficient for me to give you the raw >>>> ARPA file and for you to compile it. We'll see how it goes. Ping me in a >>>> day if you don't hear from me. >>>> >>>> matt >>>> >>>> >>>>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) >>>>> <chris.a.mattm...@jpl.nasa.gov> wrote: >>>>> >>>>> Yes please! :) >>>>> >>>>> Sent from my iPhone >>>>> >>>>>> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: >>>>>> >>>>>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM >>>>>> compiles of it failed in the past, but I'll try again. I expect it to >>>>>> be about 8 GB when that's done. Do you want it? >>>>>> >>>>>> matt >>>>>> >>>>>> >>>>>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) >>>>>>> <chris.a.mattm...@jpl.nasa.gov> wrote: >>>>>>> >>>>>>> Hey Folks, >>>>>>> >>>>>>> Anyone have a Russian Language Model for Joshua? Lewis was working on >>>>>>> one, not sure if he has it but just broadening the question. >>>>>>> >>>>>>> Cheers, >>>>>>> Chris >>>>>>> >>>>>>> ++ >>>>>>> Chris Mattmann, Ph.D. >>>>>>> Chief Architect >>>>>>> Instrument Software and Science Data Systems Section (398) >>>>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>>>>>> Office: 168-519, Mailstop: 168-527 >>>>>>> Email: chris.a.mattm...@nasa.gov >>>>>>> WWW: http://sunset.usc.edu/~mattmann/ >>>>>>> ++ >>>>>>> Director, Information Retrieval and Data Science Group (IRDS) >>>>>>> Adjunct Associate Professor, Computer Science Department >>>>>>> University of Southern California, Los Angeles, CA 90089 USA >>>>>>> WWW: http://irds.usc.edu/ >>>>>>> ++ >>>>>> >>>> >>> >> >
Re: Russian Language Model for Joshua
HTTP resume exists for a reason. If you ask me nicely I'll post it to a US S3 bucket next week! :P -- Director Meteorite.bi - Saiku Analytics Founder Tel: +44(0)5603641316 (Thanks to the Saiku community we reached our Kickstart <http://kickstarter.com/projects/2117053714/saiku-reporting-interactive-report-designer/> goal, but you can always help by sponsoring the project <http://www.meteorite.bi/products/saiku/sponsorship>) On 17 July 2016 at 23:06, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Thanks Tom and Matt. I’m downloading now, but plane WiFi sucks so > I may need to restart at some point > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Director, Information Retrieval and Data Science Group (IRDS) > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++ > > > > > > > > > > > On 7/16/16, 3:16 PM, "Tom Barber" <t...@analytical-labs.com> wrote: > > >I can host it: http://meteorite.bi/downloads/ru.kenlm > > > >Tom > > > >-- > > > >Director Meteorite.bi - Saiku Analytics Founder > >Tel: +44(0)5603641316 > > > >(Thanks to the Saiku community we reached our Kickstart > >< > http://kickstarter.com/projects/2117053714/saiku-reporting-interactive-report-designer/ > > > >goal, but you can always help by sponsoring the project > ><http://www.meteorite.bi/products/saiku/sponsorship>) > > > >On 16 July 2016 at 22:45, Mcgibbney, Lewis J (398M) < > >lewis.j.mcgibb...@jpl.nasa.gov> wrote: > > > >> Can you make this public for good? Or is it the size which is the issue? > >> Is this build using master branch Matt? I am having issues building > models > >> with masterŠ I¹ll post my issues on another thread. > >> > >> Dr. Lewis John McGibbney Ph.D., B.Sc. > >> Data Scientist II > >> Computer Science for Data Intensive Applications Group 398M > >> Jet Propulsion Laboratory > >> California Institute of Technology > >> 4800 Oak Grove Drive > >> Pasadena, California 91109-8099 > >> Mail Stop : 158-256C > >> Tel: (+1) (818)-393-7402 > >> Cell: (+1) (626)-487-3476 > >> Fax: (+1) (818)-393-1190 > >> Email: lewis.j.mcgibb...@jpl.nasa.gov > >> > >> > >> > >> Dare Mighty Things > >> > >> > >> > >> > >> > >> > >> > >> > >> > >> > >> > >> On 7/16/16, 1:09 PM, "Matt Post" <p...@cs.jhu.edu> wrote: > >> > >> >Done: > >> > > >> > http://cs.jhu.edu/~post/tmp/ru.kenlm > >> > 4106251755 bytes, sha1sum: > 5c894e24dafa42bc44a5bb6822812d6234eda791 > >> > > >> >Let me know when you have it so I can delete it. > >> > > >> >matt > >> > > >> > > >> >> On Jul 15, 2016, at 4:42 PM, Matt Post <p...@cs.jhu.edu> wrote: > >> >> > >> >> All right, started trying to recompile. If you have a machine with > > >> >>256 GB of memory, it might be more efficient for me to give you the > raw > >> >>ARPA file and for you to compile it. We'll see how it goes. Ping me > in a > >> >>day if you don't hear from me. > >> >> > >> >> matt > >> >> > >> >> > >> >>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) > >> >>><chris.a.mattm...@jpl.nasa.gov> wrote: > >> >>> > >> >>> Yes please! :) > >> >>> > >> >>> Sent from my iPhone > >> >>> > >> >>>> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: > >> >>>> > >> >>>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM > >> >>>>compiles of it failed in the past, but I'll try again. I expect it > to > >> >>>&g
Re: Russian Language Model for Joshua
I can host it: http://meteorite.bi/downloads/ru.kenlm Tom -- Director Meteorite.bi - Saiku Analytics Founder Tel: +44(0)5603641316 (Thanks to the Saiku community we reached our Kickstart <http://kickstarter.com/projects/2117053714/saiku-reporting-interactive-report-designer/> goal, but you can always help by sponsoring the project <http://www.meteorite.bi/products/saiku/sponsorship>) On 16 July 2016 at 22:45, Mcgibbney, Lewis J (398M) < lewis.j.mcgibb...@jpl.nasa.gov> wrote: > Can you make this public for good? Or is it the size which is the issue? > Is this build using master branch Matt? I am having issues building models > with masterŠ I¹ll post my issues on another thread. > > Dr. Lewis John McGibbney Ph.D., B.Sc. > Data Scientist II > Computer Science for Data Intensive Applications Group 398M > Jet Propulsion Laboratory > California Institute of Technology > 4800 Oak Grove Drive > Pasadena, California 91109-8099 > Mail Stop : 158-256C > Tel: (+1) (818)-393-7402 > Cell: (+1) (626)-487-3476 > Fax: (+1) (818)-393-1190 > Email: lewis.j.mcgibb...@jpl.nasa.gov > > > > Dare Mighty Things > > > > > > > > > > > > On 7/16/16, 1:09 PM, "Matt Post" <p...@cs.jhu.edu> wrote: > > >Done: > > > > http://cs.jhu.edu/~post/tmp/ru.kenlm > > 4106251755 bytes, sha1sum: 5c894e24dafa42bc44a5bb6822812d6234eda791 > > > >Let me know when you have it so I can delete it. > > > >matt > > > > > >> On Jul 15, 2016, at 4:42 PM, Matt Post <p...@cs.jhu.edu> wrote: > >> > >> All right, started trying to recompile. If you have a machine with > > >>256 GB of memory, it might be more efficient for me to give you the raw > >>ARPA file and for you to compile it. We'll see how it goes. Ping me in a > >>day if you don't hear from me. > >> > >> matt > >> > >> > >>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) > >>><chris.a.mattm...@jpl.nasa.gov> wrote: > >>> > >>> Yes please! :) > >>> > >>> Sent from my iPhone > >>> > >>>> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: > >>>> > >>>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM > >>>>compiles of it failed in the past, but I'll try again. I expect it to > >>>>be about 8 GB when that's done. Do you want it? > >>>> > >>>> matt > >>>> > >>>> > >>>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) > >>>>><chris.a.mattm...@jpl.nasa.gov> wrote: > >>>>> > >>>>> Hey Folks, > >>>>> > >>>>> Anyone have a Russian Language Model for Joshua? Lewis was working on > >>>>> one, not sure if he has it but just broadening the question. > >>>>> > >>>>> Cheers, > >>>>> Chris > >>>>> > >>>>> ++ > >>>>> Chris Mattmann, Ph.D. > >>>>> Chief Architect > >>>>> Instrument Software and Science Data Systems Section (398) > >>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >>>>> Office: 168-519, Mailstop: 168-527 > >>>>> Email: chris.a.mattm...@nasa.gov > >>>>> WWW: http://sunset.usc.edu/~mattmann/ > >>>>> ++ > >>>>> Director, Information Retrieval and Data Science Group (IRDS) > >>>>> Adjunct Associate Professor, Computer Science Department > >>>>> University of Southern California, Los Angeles, CA 90089 USA > >>>>> WWW: http://irds.usc.edu/ > >>>>> ++ > >>>> > >> > > > >
Re: Russian Language Model for Joshua
Can you make this public for good? Or is it the size which is the issue? Is this build using master branch Matt? I am having issues building models with masterŠ I¹ll post my issues on another thread. Dr. Lewis John McGibbney Ph.D., B.Sc. Data Scientist II Computer Science for Data Intensive Applications Group 398M Jet Propulsion Laboratory California Institute of Technology 4800 Oak Grove Drive Pasadena, California 91109-8099 Mail Stop : 158-256C Tel: (+1) (818)-393-7402 Cell: (+1) (626)-487-3476 Fax: (+1) (818)-393-1190 Email: lewis.j.mcgibb...@jpl.nasa.gov Dare Mighty Things On 7/16/16, 1:09 PM, "Matt Post" <p...@cs.jhu.edu> wrote: >Done: > > http://cs.jhu.edu/~post/tmp/ru.kenlm > 4106251755 bytes, sha1sum: 5c894e24dafa42bc44a5bb6822812d6234eda791 > >Let me know when you have it so I can delete it. > >matt > > >> On Jul 15, 2016, at 4:42 PM, Matt Post <p...@cs.jhu.edu> wrote: >> >> All right, started trying to recompile. If you have a machine with > >>256 GB of memory, it might be more efficient for me to give you the raw >>ARPA file and for you to compile it. We'll see how it goes. Ping me in a >>day if you don't hear from me. >> >> matt >> >> >>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) >>><chris.a.mattm...@jpl.nasa.gov> wrote: >>> >>> Yes please! :) >>> >>> Sent from my iPhone >>> >>>> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: >>>> >>>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM >>>>compiles of it failed in the past, but I'll try again. I expect it to >>>>be about 8 GB when that's done. Do you want it? >>>> >>>> matt >>>> >>>> >>>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) >>>>><chris.a.mattm...@jpl.nasa.gov> wrote: >>>>> >>>>> Hey Folks, >>>>> >>>>> Anyone have a Russian Language Model for Joshua? Lewis was working on >>>>> one, not sure if he has it but just broadening the question. >>>>> >>>>> Cheers, >>>>> Chris >>>>> >>>>> ++ >>>>> Chris Mattmann, Ph.D. >>>>> Chief Architect >>>>> Instrument Software and Science Data Systems Section (398) >>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>>>> Office: 168-519, Mailstop: 168-527 >>>>> Email: chris.a.mattm...@nasa.gov >>>>> WWW: http://sunset.usc.edu/~mattmann/ >>>>> ++ >>>>> Director, Information Retrieval and Data Science Group (IRDS) >>>>> Adjunct Associate Professor, Computer Science Department >>>>> University of Southern California, Los Angeles, CA 90089 USA >>>>> WWW: http://irds.usc.edu/ >>>>> ++ >>>> >> >
Re: Russian Language Model for Joshua
Done: http://cs.jhu.edu/~post/tmp/ru.kenlm 4106251755 bytes, sha1sum: 5c894e24dafa42bc44a5bb6822812d6234eda791 Let me know when you have it so I can delete it. matt > On Jul 15, 2016, at 4:42 PM, Matt Post <p...@cs.jhu.edu> wrote: > > All right, started trying to recompile. If you have a machine with > 256 GB > of memory, it might be more efficient for me to give you the raw ARPA file > and for you to compile it. We'll see how it goes. Ping me in a day if you > don't hear from me. > > matt > > >> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) >> <chris.a.mattm...@jpl.nasa.gov> wrote: >> >> Yes please! :) >> >> Sent from my iPhone >> >>> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: >>> >>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM >>> compiles of it failed in the past, but I'll try again. I expect it to be >>> about 8 GB when that's done. Do you want it? >>> >>> matt >>> >>> >>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) >>>> <chris.a.mattm...@jpl.nasa.gov> wrote: >>>> >>>> Hey Folks, >>>> >>>> Anyone have a Russian Language Model for Joshua? Lewis was working on >>>> one, not sure if he has it but just broadening the question. >>>> >>>> Cheers, >>>> Chris >>>> >>>> ++ >>>> Chris Mattmann, Ph.D. >>>> Chief Architect >>>> Instrument Software and Science Data Systems Section (398) >>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>>> Office: 168-519, Mailstop: 168-527 >>>> Email: chris.a.mattm...@nasa.gov >>>> WWW: http://sunset.usc.edu/~mattmann/ >>>> ++ >>>> Director, Information Retrieval and Data Science Group (IRDS) >>>> Adjunct Associate Professor, Computer Science Department >>>> University of Southern California, Los Angeles, CA 90089 USA >>>> WWW: http://irds.usc.edu/ >>>> ++ >>> >
Re: Russian Language Model for Joshua
no worries I got it packed. will email later tonight. matt (from my phone) > On Jul 15, 2016, at 6:32 PM, Mattmann, Chris A (3980) > <chris.a.mattm...@jpl.nasa.gov> wrote: > > Will do. > > Adding Paul Zimdars - do we have an Amazon machine that has > 256GB > of memory? How much would that cost? > > Cheers, > Chris > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Director, Information Retrieval and Data Science Group (IRDS) > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++ > > > > > > > > > > >> On 7/15/16, 1:42 PM, "Matt Post" <p...@cs.jhu.edu> wrote: >> >> All right, started trying to recompile. If you have a machine with > 256 GB >> of memory, it might be more efficient for me to give you the raw ARPA file >> and for you to compile it. We'll see how it goes. Ping me in a day if you >> don't hear from me. >> >> matt >> >> >>> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) >>> <chris.a.mattm...@jpl.nasa.gov> wrote: >>> >>> Yes please! :) >>> >>> Sent from my iPhone >>> >>>> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: >>>> >>>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM >>>> compiles of it failed in the past, but I'll try again. I expect it to be >>>> about 8 GB when that's done. Do you want it? >>>> >>>> matt >>>> >>>> >>>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) >>>>> <chris.a.mattm...@jpl.nasa.gov> wrote: >>>>> >>>>> Hey Folks, >>>>> >>>>> Anyone have a Russian Language Model for Joshua? Lewis was working on >>>>> one, not sure if he has it but just broadening the question. >>>>> >>>>> Cheers, >>>>> Chris >>>>> >>>>> ++ >>>>> Chris Mattmann, Ph.D. >>>>> Chief Architect >>>>> Instrument Software and Science Data Systems Section (398) >>>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>>>> Office: 168-519, Mailstop: 168-527 >>>>> Email: chris.a.mattm...@nasa.gov >>>>> WWW: http://sunset.usc.edu/~mattmann/ >>>>> ++ >>>>> Director, Information Retrieval and Data Science Group (IRDS) >>>>> Adjunct Associate Professor, Computer Science Department >>>>> University of Southern California, Los Angeles, CA 90089 USA >>>>> WWW: http://irds.usc.edu/ >>>>> ++ >>
Re: Russian Language Model for Joshua
Street price is: r3.8xlarge 32 104 244 2 x 320 SSD $2.66 per Hour -- Director Meteorite.bi - Saiku Analytics Founder Tel: +44(0)5603641316 (Thanks to the Saiku community we reached our Kickstart <http://kickstarter.com/projects/2117053714/saiku-reporting-interactive-report-designer/> goal, but you can always help by sponsoring the project <http://www.meteorite.bi/products/saiku/sponsorship>) On 15 July 2016 at 23:32, Mattmann, Chris A (3980) < chris.a.mattm...@jpl.nasa.gov> wrote: > Will do. > > Adding Paul Zimdars - do we have an Amazon machine that has > 256GB > of memory? How much would that cost? > > Cheers, > Chris > > ++ > Chris Mattmann, Ph.D. > Chief Architect > Instrument Software and Science Data Systems Section (398) > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 168-519, Mailstop: 168-527 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++ > Director, Information Retrieval and Data Science Group (IRDS) > Adjunct Associate Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > WWW: http://irds.usc.edu/ > ++ > > > > > > > > > > > On 7/15/16, 1:42 PM, "Matt Post" <p...@cs.jhu.edu> wrote: > > >All right, started trying to recompile. If you have a machine with > 256 > GB of memory, it might be more efficient for me to give you the raw ARPA > file and for you to compile it. We'll see how it goes. Ping me in a day if > you don't hear from me. > > > >matt > > > > > >> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) < > chris.a.mattm...@jpl.nasa.gov> wrote: > >> > >> Yes please! :) > >> > >> Sent from my iPhone > >> > >>> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: > >>> > >>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM > compiles of it failed in the past, but I'll try again. I expect it to be > about 8 GB when that's done. Do you want it? > >>> > >>> matt > >>> > >>> > >>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) < > chris.a.mattm...@jpl.nasa.gov> wrote: > >>>> > >>>> Hey Folks, > >>>> > >>>> Anyone have a Russian Language Model for Joshua? Lewis was working on > >>>> one, not sure if he has it but just broadening the question. > >>>> > >>>> Cheers, > >>>> Chris > >>>> > >>>> ++ > >>>> Chris Mattmann, Ph.D. > >>>> Chief Architect > >>>> Instrument Software and Science Data Systems Section (398) > >>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > >>>> Office: 168-519, Mailstop: 168-527 > >>>> Email: chris.a.mattm...@nasa.gov > >>>> WWW: http://sunset.usc.edu/~mattmann/ > >>>> ++ > >>>> Director, Information Retrieval and Data Science Group (IRDS) > >>>> Adjunct Associate Professor, Computer Science Department > >>>> University of Southern California, Los Angeles, CA 90089 USA > >>>> WWW: http://irds.usc.edu/ > >>>> ++ > >>> > > >
Re: Russian Language Model for Joshua
Will do. Adding Paul Zimdars - do we have an Amazon machine that has > 256GB of memory? How much would that cost? Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Director, Information Retrieval and Data Science Group (IRDS) Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA WWW: http://irds.usc.edu/ ++ On 7/15/16, 1:42 PM, "Matt Post" <p...@cs.jhu.edu> wrote: >All right, started trying to recompile. If you have a machine with > 256 GB of >memory, it might be more efficient for me to give you the raw ARPA file and >for you to compile it. We'll see how it goes. Ping me in a day if you don't >hear from me. > >matt > > >> On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) >> <chris.a.mattm...@jpl.nasa.gov> wrote: >> >> Yes please! :) >> >> Sent from my iPhone >> >>> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: >>> >>> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM >>> compiles of it failed in the past, but I'll try again. I expect it to be >>> about 8 GB when that's done. Do you want it? >>> >>> matt >>> >>> >>>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) >>>> <chris.a.mattm...@jpl.nasa.gov> wrote: >>>> >>>> Hey Folks, >>>> >>>> Anyone have a Russian Language Model for Joshua? Lewis was working on >>>> one, not sure if he has it but just broadening the question. >>>> >>>> Cheers, >>>> Chris >>>> >>>> ++ >>>> Chris Mattmann, Ph.D. >>>> Chief Architect >>>> Instrument Software and Science Data Systems Section (398) >>>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>>> Office: 168-519, Mailstop: 168-527 >>>> Email: chris.a.mattm...@nasa.gov >>>> WWW: http://sunset.usc.edu/~mattmann/ >>>> ++ >>>> Director, Information Retrieval and Data Science Group (IRDS) >>>> Adjunct Associate Professor, Computer Science Department >>>> University of Southern California, Los Angeles, CA 90089 USA >>>> WWW: http://irds.usc.edu/ >>>> ++ >>> >
Re: Russian Language Model for Joshua
All right, started trying to recompile. If you have a machine with > 256 GB of memory, it might be more efficient for me to give you the raw ARPA file and for you to compile it. We'll see how it goes. Ping me in a day if you don't hear from me. matt > On Jul 15, 2016, at 4:40 PM, Mattmann, Chris A (3980) > <chris.a.mattm...@jpl.nasa.gov> wrote: > > Yes please! :) > > Sent from my iPhone > >> On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: >> >> I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM compiles >> of it failed in the past, but I'll try again. I expect it to be about 8 GB >> when that's done. Do you want it? >> >> matt >> >> >>> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) >>> <chris.a.mattm...@jpl.nasa.gov> wrote: >>> >>> Hey Folks, >>> >>> Anyone have a Russian Language Model for Joshua? Lewis was working on >>> one, not sure if he has it but just broadening the question. >>> >>> Cheers, >>> Chris >>> >>> ++ >>> Chris Mattmann, Ph.D. >>> Chief Architect >>> Instrument Software and Science Data Systems Section (398) >>> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >>> Office: 168-519, Mailstop: 168-527 >>> Email: chris.a.mattm...@nasa.gov >>> WWW: http://sunset.usc.edu/~mattmann/ >>> ++ >>> Director, Information Retrieval and Data Science Group (IRDS) >>> Adjunct Associate Professor, Computer Science Department >>> University of Southern California, Los Angeles, CA 90089 USA >>> WWW: http://irds.usc.edu/ >>> ++ >>
Re: Russian Language Model for Joshua
Yes please! :) Sent from my iPhone > On Jul 15, 2016, at 1:39 PM, Matt Post <p...@cs.jhu.edu> wrote: > > I have one built on Common Crawl. It's 25 GB uncompressed. My KenLM compiles > of it failed in the past, but I'll try again. I expect it to be about 8 GB > when that's done. Do you want it? > > matt > > >> On Jul 15, 2016, at 3:50 PM, Mattmann, Chris A (3980) >> <chris.a.mattm...@jpl.nasa.gov> wrote: >> >> Hey Folks, >> >> Anyone have a Russian Language Model for Joshua? Lewis was working on >> one, not sure if he has it but just broadening the question. >> >> Cheers, >> Chris >> >> ++ >> Chris Mattmann, Ph.D. >> Chief Architect >> Instrument Software and Science Data Systems Section (398) >> NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA >> Office: 168-519, Mailstop: 168-527 >> Email: chris.a.mattm...@nasa.gov >> WWW: http://sunset.usc.edu/~mattmann/ >> ++ >> Director, Information Retrieval and Data Science Group (IRDS) >> Adjunct Associate Professor, Computer Science Department >> University of Southern California, Los Angeles, CA 90089 USA >> WWW: http://irds.usc.edu/ >> ++ >
Russian Language Model for Joshua
Hey Folks, Anyone have a Russian Language Model for Joshua? Lewis was working on one, not sure if he has it but just broadening the question. Cheers, Chris ++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: chris.a.mattm...@nasa.gov WWW: http://sunset.usc.edu/~mattmann/ ++ Director, Information Retrieval and Data Science Group (IRDS) Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA WWW: http://irds.usc.edu/ ++