Re: [Discuss] Serverless scientific computing (function as a service)
I have anecdotes rather than data, but around here it's getting to be more popular as more people try it out. Folks do need to take care to look through their grant terms to see if there's any specific language about region restrictions or cloud computing restrictions. (I have a friend who keeps burning through SSDs on an under-her-desk server who would love to have Amazon's resources available but her grant specifically prohibits cloud computing.) One local example: Here at the University of Illinois we had a team spend 3 months analyzing a set of data, only to discover at the end of the 3 months that there was a software version mismatch that made that run incompatible with the rest of their results. They came to talk to my department's cloud and virtualization team about what could be done, and discovered that there was a way to re-run their work in 3 days at a really cost effective price point on Amazon Web Services with our team's help. (Our lead thinks he might be able to get that originally 3 month process down into 1 day with a little more optimization.) Our AWS team often serves as an intermediary to help guide researchers through navigating the networking, security, and optimization issues. -Dena Strong, Technology Services, University of Illinois -Original Message- From: Discuss [mailto:discuss-boun...@lists.software-carpentry.org] On Behalf Of Peter Steinbach Sent: Tuesday, June 13, 2017 6:38 AM To: discuss@lists.software-carpentry.org Subject: Re: [Discuss] Serverless scientific computing (function as a service) Dear both, as a side note (and my apologies for digressing), I was wondering how popular cloud computing for data processing at scale in an academic context is in the US or elsewhere? Here in Europe, many universities run their own HPC centers where people can sign up to process larger amounts of data or do larger simulations or whatnot ... mostly people here are concerned about efficiency (data connnections into the cloud are typically poor, VM overhead is considerable) and security/confidentiality when putting scientific workflows into the cloud. What is your take on this? Best, Peter PS. I love the "serverless" metaphor. Get's rid of all the problems of computers. ;) On 06/12/2017 06:02 PM, Marianne Corvellec wrote: > Hi Justin, > > Thank you so much for the quick reply! > > I'm going to give this new package a try. > > Best, > Marianne > > On Fri, Jun 9, 2017 at 11:20 AM, Justin Kitzeswrote: >> Hi Marianne, >> >> PyWren by Eric Jonas sounds like it's pretty similar to what you're >> looking for - >> >> http://pywren.io/ >> >> It's a relatively new package that's still in active development, but Eric >> is very interested in expanding it (and has some support from the riselab at >> UC Berkeley to do so). I know that he's also actively looking for use cases, >> so I'd definitely suggest getting in touch with him if you're interested. >> >> Best, >> >> Justin >> >> -- >> Justin Kitzes >> Energy and Resources Group >> Berkeley Institute for Data Science >> University of California, Berkeley >> >>> On Jun 9, 2017, at 6:51 AM, Marianne Corvellec >>> wrote: >>> >>> Dear community, >>> >>> I'm curious as to whether some of you might have worked on or used >>> solutions such as AWS Lambda in the context of your scientific >>> research. >>> >>> If so, have you documented it in a blog post that you could share? >>> Thanks in advance! >>> >>> Without even considering workflows or full-fledged projects, >>> wouldn't we want to be able to make a standard API call to, say, fit >>> a polynomial to some data? Is anyone aware of any effort in this >>> direction? >>> >>> A friend of mine just drew my attention to this general issue, which >>> touches on open science and reproducible research... In the >>> meantime, I'll encourage him to join this mailing list! >>> >>> Thank you, >>> Marianne >>> ___ >>> Discuss mailing list >>> Discuss@lists.software-carpentry.org >>> http://lists.software-carpentry.org/listinfo/discuss >> > ___ > Discuss mailing list > Discuss@lists.software-carpentry.org > http://lists.software-carpentry.org/listinfo/discuss > ___ Discuss mailing list Discuss@lists.software-carpentry.org http://lists.software-carpentry.org/listinfo/discuss ___ Discuss mailing list Discuss@lists.software-carpentry.org http://lists.software-carpentry.org/listinfo/discuss
Re: [Discuss] Serverless scientific computing (function as a service)
Hi Peter, I wouldnt be able to use such services with clinical data. It's totally not an option for me. Although I've seen some talks and the performance seems quite competitive since scalability is easy. It's true that uploading a big quantity of data can take a considerable time and bandwith, some labs use the weekends for data uploading. One problem may be to convince University fund managers to pay for external computing services when they already provide HPC services. My five cents... On Tue, 13 Jun 2017, 13:38 Peter Steinbach,wrote: > Dear both, > > as a side note (and my apologies for digressing), I was wondering how > popular cloud computing for data processing at scale in an academic > context is in the US or elsewhere? > > Here in Europe, many universities run their own HPC centers where people > can sign up to process larger amounts of data or do larger simulations > or whatnot ... mostly people here are concerned about efficiency (data > connnections into the cloud are typically poor, VM overhead is > considerable) and security/confidentiality when putting scientific > workflows into the cloud. > What is your take on this? > > Best, > Peter > > > PS. I love the "serverless" metaphor. Get's rid of all the problems of > computers. ;) > > On 06/12/2017 06:02 PM, Marianne Corvellec wrote: > > Hi Justin, > > > > Thank you so much for the quick reply! > > > > I'm going to give this new package a try. > > > > Best, > > Marianne > > > > On Fri, Jun 9, 2017 at 11:20 AM, Justin Kitzes > wrote: > >> Hi Marianne, > >> > >> PyWren by Eric Jonas sounds like it's pretty similar to what you're > looking for - > >> > >> http://pywren.io/ > >> > >> It's a relatively new package that's still in active development, but > Eric is very interested in expanding it (and has some support from the > riselab at UC Berkeley to do so). I know that he's also actively looking > for use cases, so I'd definitely suggest getting in touch with him if > you're interested. > >> > >> Best, > >> > >> Justin > >> > >> -- > >> Justin Kitzes > >> Energy and Resources Group > >> Berkeley Institute for Data Science > >> University of California, Berkeley > >> > >>> On Jun 9, 2017, at 6:51 AM, Marianne Corvellec < > marianne.corvel...@gmail.com> wrote: > >>> > >>> Dear community, > >>> > >>> I'm curious as to whether some of you might have worked on or used > >>> solutions such as AWS Lambda in the context of your scientific > >>> research. > >>> > >>> If so, have you documented it in a blog post that you could share? > >>> Thanks in advance! > >>> > >>> Without even considering workflows or full-fledged projects, wouldn't > >>> we want to be able to make a standard API call to, say, fit a > >>> polynomial to some data? Is anyone aware of any effort in this > >>> direction? > >>> > >>> A friend of mine just drew my attention to this general issue, which > >>> touches on open science and reproducible research... In the meantime, > >>> I'll encourage him to join this mailing list! > >>> > >>> Thank you, > >>> Marianne > >>> ___ > >>> Discuss mailing list > >>> Discuss@lists.software-carpentry.org > >>> http://lists.software-carpentry.org/listinfo/discuss > >> > > ___ > > Discuss mailing list > > Discuss@lists.software-carpentry.org > > http://lists.software-carpentry.org/listinfo/discuss > > > ___ > Discuss mailing list > Discuss@lists.software-carpentry.org > http://lists.software-carpentry.org/listinfo/discuss -- Sent from my phone, sorry for brevity or typos. ___ Discuss mailing list Discuss@lists.software-carpentry.org http://lists.software-carpentry.org/listinfo/discuss
Re: [Discuss] Serverless scientific computing (function as a service)
Dear both, as a side note (and my apologies for digressing), I was wondering how popular cloud computing for data processing at scale in an academic context is in the US or elsewhere? Here in Europe, many universities run their own HPC centers where people can sign up to process larger amounts of data or do larger simulations or whatnot ... mostly people here are concerned about efficiency (data connnections into the cloud are typically poor, VM overhead is considerable) and security/confidentiality when putting scientific workflows into the cloud. What is your take on this? Best, Peter PS. I love the "serverless" metaphor. Get's rid of all the problems of computers. ;) On 06/12/2017 06:02 PM, Marianne Corvellec wrote: Hi Justin, Thank you so much for the quick reply! I'm going to give this new package a try. Best, Marianne On Fri, Jun 9, 2017 at 11:20 AM, Justin Kitzeswrote: Hi Marianne, PyWren by Eric Jonas sounds like it's pretty similar to what you're looking for - http://pywren.io/ It's a relatively new package that's still in active development, but Eric is very interested in expanding it (and has some support from the riselab at UC Berkeley to do so). I know that he's also actively looking for use cases, so I'd definitely suggest getting in touch with him if you're interested. Best, Justin -- Justin Kitzes Energy and Resources Group Berkeley Institute for Data Science University of California, Berkeley On Jun 9, 2017, at 6:51 AM, Marianne Corvellec wrote: Dear community, I'm curious as to whether some of you might have worked on or used solutions such as AWS Lambda in the context of your scientific research. If so, have you documented it in a blog post that you could share? Thanks in advance! Without even considering workflows or full-fledged projects, wouldn't we want to be able to make a standard API call to, say, fit a polynomial to some data? Is anyone aware of any effort in this direction? A friend of mine just drew my attention to this general issue, which touches on open science and reproducible research... In the meantime, I'll encourage him to join this mailing list! Thank you, Marianne ___ Discuss mailing list Discuss@lists.software-carpentry.org http://lists.software-carpentry.org/listinfo/discuss ___ Discuss mailing list Discuss@lists.software-carpentry.org http://lists.software-carpentry.org/listinfo/discuss ___ Discuss mailing list Discuss@lists.software-carpentry.org http://lists.software-carpentry.org/listinfo/discuss