[ccp4bb] Secondary structure prediction for billions of sequences ?

2020-02-19 Thread hari jayaram
Hi all
It’s been a while since I posted on this group so apologies for a slightly
tangential, non CCP4 question.

I wanted to get secondary structure predictions for designing a library of
50-100 amino acid peptides. The library could get very large ( 10^9) and I
was wondering if there is a way to predict the secondary structure for this
large set of sequences that scales well.

I am currently using psipred which works well, but runs one sequences at a
time and generates the desired prediction with concurrent creation of many
temp files.

Although I realize this is solvable with the current approach, by
parallelizing the calls  I was wondering if there are other approaches that
are suitable for this.

Thanks for your help in advance
Hari Jayaram



To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/webadmin?SUBED1=CCP4BB&A=1


Re: [ccp4bb] secondary structure prediction

2017-12-15 Thread Joana Pereira
Dear Zheng,

There are some things you can try, which depend on the computational resources 
you have available and the time you want to wait. The best would be to try 
Rosetta ab initio modelling. Preferably, you would run Rosetta using your local 
installation but if you don’t have Rosetta installed, this can be annoying to 
do. You can also use the Rosie web server (the Rosetta web server), but it 
takes long time to compute. Other faster solutions for "ab initio" modelling 
would be Quark (in case your protein is short), I-tasser or RaptorX Contact 
Prediction web servers. I am very happy with the latter, I use it frequently 
when I don’t want to wait for Rosetta. You can also try homology modelling, but 
this would be less reliable when the few motifs you find do not overlap and do 
not cover the entire protein. Anyway, for homology modelling you can try , for 
example, Modeller after HHPred (you can do this using the MPI Bioinformatics 
toolkit) or RaptorX Structure Prediction.

I hope this helps you.

Best regards and good luck!

Dr. Joana Pereira
--
Postdoctoral Researcher
Department of Protein Evolution

Max Planck Institute for Developmental Biology
Spemannstraße 35
72076 Tübingen
GERMANY


> On 06 Dec 2017, at 15:14, zheng zhou  wrote:
> 
> Dear CCP4 community,
> 
> Sorry for the off-topic question. I am trying to design constructs for
> structure studies. It only has a homolog structure in PDB with
> sequence identity ~20%. When I blast against PDB sequence, there are
> quite a few motif hits (30~40aa, identity 40~50%). Any prediction
> tools utilize this information?
> 
> Thanks for your advice in advance.
> 
> Best,
> 
> Zheng




Re: [ccp4bb] secondary structure prediction

2017-12-07 Thread Phoebe A. Rice
I'm fond of RaptorX


++

Phoebe A. Rice
Dept. of Biochemistry & Molecular Biology
The University of Chicago

pr...@uchicago.edu<mailto:pr...@uchicago.edu>
https://voices.uchicago.edu/phoebericelab/


From: CCP4 bulletin board [CCP4BB@JISCMAIL.AC.UK] on behalf of Missoury Sophia 
[sophia.misso...@u-psud.fr]
Sent: Wednesday, December 06, 2017 9:20 AM
To: CCP4BB@JISCMAIL.AC.UK
Subject: Re: [ccp4bb] secondary structure prediction

Modeller


Sophia Missoury





De: "Smith Liu" 
À: CCP4BB@JISCMAIL.AC.UK
Envoyé: Mercredi 6 Décembre 2017 16:04:12
Objet: Re: [ccp4bb] secondary structure prediction

Rosetta



<https://maas.mail.163.com/dashi-web-extend/html/proSignature.html?iconUrl=https://nos.netease.com/mail-online/qiyelogo/defaultAvatar.png&name=Smith%20Liu&uid=smith_liu123%40163.com&ftlId=1&items=%5B%22%E9%82%AE%E7%AE%B1%EF%BC%9Asmith_liu123%40163.com%22%5D>
[https://nos.netease.com/mail-online/qiyelogo/defaultAvatar.png]
Smith Liu

邮箱:smith_liu...@163.com


签名由 网易邮箱大师<https://mail.163.com/dashi/dlpro.html?from=mail88> 定制


在2017年12月06日 22:14,zheng zhou<mailto:zhengzho...@gmail.com> 写道:
Dear CCP4 community,

Sorry for the off-topic question. I am trying to design constructs for
structure studies. It only has a homolog structure in PDB with
sequence identity ~20%. When I blast against PDB sequence, there are
quite a few motif hits (30~40aa, identity 40~50%). Any prediction
tools utilize this information?

Thanks for your advice in advance.

Best,

Zheng



Re: [ccp4bb] secondary structure prediction

2017-12-07 Thread Smith Liu
yes




| |
Smith Liu
|
|
邮箱:smith_liu...@163.com
|

签名由 网易邮箱大师 定制




在2017年12月07日 09:11,zheng zhou 写道:
Thanks for many advices. I was not clear in the previous email. I know
the close homologous protein (20% identity, total 500aa), but the
fragment hits (30~40aa, identity 40~50%) are from other proteins in
PDB. I am trying to see whether the fragments from non-homologous
proteins may help the secondary structure prediction.

Best,
Z

On Wed, Dec 6, 2017 at 10:14 PM, zheng zhou  wrote:
> Dear CCP4 community,
>
> Sorry for the off-topic question. I am trying to design constructs for
> structure studies. It only has a homolog structure in PDB with
> sequence identity ~20%. When I blast against PDB sequence, there are
> quite a few motif hits (30~40aa, identity 40~50%). Any prediction
> tools utilize this information?
>
> Thanks for your advice in advance.
>
> Best,
>
> Zheng


Re: [ccp4bb] secondary structure prediction

2017-12-06 Thread zheng zhou
Thanks for many advices. I was not clear in the previous email. I know
the close homologous protein (20% identity, total 500aa), but the
fragment hits (30~40aa, identity 40~50%) are from other proteins in
PDB. I am trying to see whether the fragments from non-homologous
proteins may help the secondary structure prediction.

Best,
Z

On Wed, Dec 6, 2017 at 10:14 PM, zheng zhou  wrote:
> Dear CCP4 community,
>
> Sorry for the off-topic question. I am trying to design constructs for
> structure studies. It only has a homolog structure in PDB with
> sequence identity ~20%. When I blast against PDB sequence, there are
> quite a few motif hits (30~40aa, identity 40~50%). Any prediction
> tools utilize this information?
>
> Thanks for your advice in advance.
>
> Best,
>
> Zheng


Re: [ccp4bb] secondary structure prediction

2017-12-06 Thread Missoury Sophia
Modeller 


Sophia Missoury 



- Mail original -

De: "Smith Liu"  
À: CCP4BB@JISCMAIL.AC.UK 
Envoyé: Mercredi 6 Décembre 2017 16:04:12 
Objet: Re: [ccp4bb] secondary structure prediction 

Rosetta 





Smith Liu 


邮箱:smith_liu...@163.com 


签名由 网易邮箱大师 定制 




在 2017年12月06日 22:14 , zheng zhou 写道: 
Dear CCP4 community, 

Sorry for the off-topic question. I am trying to design constructs for 
structure studies. It only has a homolog structure in PDB with 
sequence identity ~20%. When I blast against PDB sequence, there are 
quite a few motif hits (30~40aa, identity 40~50%). Any prediction 
tools utilize this information? 

Thanks for your advice in advance. 

Best, 

Zheng 






Re: [ccp4bb] secondary structure prediction

2017-12-06 Thread Gert Vriend

I would try your local bioinformatician (or a remote one if there isn't
any one local).

With the sequence in the mail, I could have given it a shot...

Gert


On 12/6/2017 3:14 PM, zheng zhou wrote:

Dear CCP4 community,

Sorry for the off-topic question. I am trying to design constructs for
structure studies. It only has a homolog structure in PDB with
sequence identity ~20%. When I blast against PDB sequence, there are
quite a few motif hits (30~40aa, identity 40~50%). Any prediction
tools utilize this information?

Thanks for your advice in advance.

Best,

Zheng


Het Radboudumc staat geregistreerd bij de Kamer van Koophandel in het 
handelsregister onder nummer 41055629.
The Radboud university medical center is listed in the Commercial Register of 
the Chamber of Commerce under file number 41055629.


Re: [ccp4bb] secondary structure prediction

2017-12-06 Thread Anastassis Perrakis
Dear Zheng,

You might want to try the tools at:

https://ccd.rhpc.nki.nl

Tassos

On Dec 6, 2017, at 15:14, zheng zhou 
mailto:zhengzho...@gmail.com>> wrote:

Dear CCP4 community,

Sorry for the off-topic question. I am trying to design constructs for
structure studies. It only has a homolog structure in PDB with
sequence identity ~20%. When I blast against PDB sequence, there are
quite a few motif hits (30~40aa, identity 40~50%). Any prediction
tools utilize this information?

Thanks for your advice in advance.

Best,

Zheng


Re: [ccp4bb] secondary structure prediction

2017-12-06 Thread Smith Liu
Rosetta




| |
Smith Liu
|
|
邮箱:smith_liu...@163.com
|

签名由 网易邮箱大师 定制




在2017年12月06日 22:14,zheng zhou 写道:
Dear CCP4 community,

Sorry for the off-topic question. I am trying to design constructs for
structure studies. It only has a homolog structure in PDB with
sequence identity ~20%. When I blast against PDB sequence, there are
quite a few motif hits (30~40aa, identity 40~50%). Any prediction
tools utilize this information?

Thanks for your advice in advance.

Best,

Zheng


[ccp4bb] secondary structure prediction

2017-12-06 Thread zheng zhou
Dear CCP4 community,

Sorry for the off-topic question. I am trying to design constructs for
structure studies. It only has a homolog structure in PDB with
sequence identity ~20%. When I blast against PDB sequence, there are
quite a few motif hits (30~40aa, identity 40~50%). Any prediction
tools utilize this information?

Thanks for your advice in advance.

Best,

Zheng