Re: [ccp4bb] AW: [ccp4bb] question regarding sequence numbering

John Berrisford Wed, 20 Sep 2017 00:06:07 -0700

Dear Tony

The only requirements we have for numbering is that every residue mustbe unique when using a combination of residue name (to handlemicroheterogeneity), residue number, insertion code and chain ID.

During curation we will try to map your protein sequence to UniProt -please see the following documentation on this process:


https://www.wwpdb.org/documentation/procedure#toc_1

The exact numbering scheme you choose is up to you (especially forexpression tags), however users of your entry may find it difficult touse your entry is you decided to number your protein randomly or withdecreasing residue numbers. We may suggest that you changed thenumbering if you did this.


Our official wording from the above link is:

"The wwPDB encourages deposition of polymer chains with sequentialresidue numbering. For protein chains, the authors are encouraged tofollow the UniProt residue numbering, wherever possible. The use ofnon-sequential residue numbering and insertion codes should be avoidedas far as possible in order to make structures easily interpretable bythe larger scientific community. If the coordinate residue numbers, asprovided by the author, are unique and sequential within a particularchain ID, the residues will not be renumbered."


this is from the section "How are chain IDs related to residue numbering?"


I hope this helps

John
PDBe


On 19/09/2017 13:51, herman.schreu...@sanofi.com wrote:

Hi Dave and Tony,
Upon submission, the pdb checks the sequence and automaticallygenerates comments about sequences derived from the expression vector.So you do not have to do anything. Given the issues many programs havewith non-sequentially numbered residues, I would also number them 7,8,9.
Best,

Herman
*Von:*CCP4 bulletin board [mailto:CCP4BB@JISCMAIL.AC.UK] *Im Auftragvon *Briggs, David C
*Gesendet:* Dienstag, 19. September 2017 14:24
*An:* CCP4BB@JISCMAIL.AC.UK
*Betreff:* Re: [ccp4bb] question regarding sequence numbering

Hi Tony,
When I've had similar issues, I've numbered them sequentially (i.e.7,8,9) and remarked in the PDB header that they are vector-derivedsequence.
I believe that is what the PDB ask you to do in situations like this(maybe they can comment?).
If they are not numbered sequentially, then often graphics andrefinement software won't treat them as linked.
Dave

--

Dr David C Briggs

Hohenester Lab

Department of Life Sciences

Imperial College London

UK
http://about.me/david_briggs<https://urldefense.proofpoint.com/v2/url?u=http-3A__about.me_david-5Fbriggs&d=DwMFAg&c=Dbf9zoswcQ-CRvvI7VX5j3HvibIuT3ZiarcKl5qtMPo&r=HK-CY_tL8CLLA93vdywyu3qI70R4H8oHzZyRHMQu1AQ&m=eXDpcWadyxbrjW5lOO-Vg-tud-0wh7P_EIdo3UxkBjU&s=CuEDTUtv1fMER1EIW76hQoC60eF1_StruW8oW9VKyFY&e=>
From: Antonio Ariza

Sent: Tuesday 19 September, 13:15

Subject: [ccp4bb] question regarding sequence numbering

To: ccp4bb@jiscmail.ac.uk <mailto:ccp4bb@jiscmail.ac.uk>

Hi all,
Here's a problem I haven't come across before. I'm working on astructure whose expression plasmid was designed to remove the first 9amino acids from the protein of interest and to which an N-terminaltag was added. After cleaving the tag I am left with 3 aminoacids (GPM) followed by the original sequence. Obviously the residuesof interest should follow the numbering of the original sequence (i.e.10, 11, 12, ...). What numbers would you assign to the first 3residues (GPM)? 7, 8, 9? -2, -1, 0?
Cheers,

Tony

------------------------------------------------------

*Dr. Antonio Ariza*

*University of Oxford*

*Sir William Dunn School of Pathology*

*South Parks Road*

*Oxford*

*OX1 3RE*

*e-mail: *antonio.ar...@path.ox.ac.uk <mailto:antonio.ar...@path.ox.ac.uk>

*Tel: 00 +44 1865 285655*

*Links to my public profiles:*
ResearchGate<https://urldefense.proofpoint.com/v2/url?u=https-3A__www.researchgate.net_profile_Antonio-5FAriza&d=DwMFAg&c=Dbf9zoswcQ-CRvvI7VX5j3HvibIuT3ZiarcKl5qtMPo&r=HK-CY_tL8CLLA93vdywyu3qI70R4H8oHzZyRHMQu1AQ&m=eXDpcWadyxbrjW5lOO-Vg-tud-0wh7P_EIdo3UxkBjU&s=JlLk_YBvsa_Pqy9U6uSWCiAB3dyF_ZQR0H_nXk4grZE&e=>
LinkedIn<https://urldefense.proofpoint.com/v2/url?u=https-3A__www.linkedin.com_in_antonioariza1&d=DwMFAg&c=Dbf9zoswcQ-CRvvI7VX5j3HvibIuT3ZiarcKl5qtMPo&r=HK-CY_tL8CLLA93vdywyu3qI70R4H8oHzZyRHMQu1AQ&m=eXDpcWadyxbrjW5lOO-Vg-tud-0wh7P_EIdo3UxkBjU&s=DbSwK3yLqHH92Pr-7NaQyVmzSSScEZ3jt8rNCa9zMbQ&e=>
GoogleScholar<https://urldefense.proofpoint.com/v2/url?u=https-3A__scholar.google.co.uk_citations-3Fuser-3D9pAIKV0AAAAJ-26hl-3Den&d=DwMFAg&c=Dbf9zoswcQ-CRvvI7VX5j3HvibIuT3ZiarcKl5qtMPo&r=HK-CY_tL8CLLA93vdywyu3qI70R4H8oHzZyRHMQu1AQ&m=eXDpcWadyxbrjW5lOO-Vg-tud-0wh7P_EIdo3UxkBjU&s=MmllbCLL0qpk3UuRY8tL6a3rtsHzxmeIXQ5QM7i4rlo&e=>
Twitter<https://urldefense.proofpoint.com/v2/url?u=https-3A__twitter.com_DrAntonioAriza-3Flang-3Den&d=DwMFAg&c=Dbf9zoswcQ-CRvvI7VX5j3HvibIuT3ZiarcKl5qtMPo&r=HK-CY_tL8CLLA93vdywyu3qI70R4H8oHzZyRHMQu1AQ&m=eXDpcWadyxbrjW5lOO-Vg-tud-0wh7P_EIdo3UxkBjU&s=Y-LOBkfQFCKxgiUgTRzNEZXrPFeOzJpt2OOBVMfaQ4Q&e=>
*Check out my latest paper!!!*
Structural insights into the function of<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.nature.com_articles_ncomms15847&d=DwMFAg&c=Dbf9zoswcQ-CRvvI7VX5j3HvibIuT3ZiarcKl5qtMPo&r=HK-CY_tL8CLLA93vdywyu3qI70R4H8oHzZyRHMQu1AQ&m=eXDpcWadyxbrjW5lOO-Vg-tud-0wh7P_EIdo3UxkBjU&s=ajttqwr7ED8_WBG6ALc86GNNTa7qa_WbddCP5AHlUd4&e=>ZRANB3<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.nature.com_articles_ncomms15847&d=DwMFAg&c=Dbf9zoswcQ-CRvvI7VX5j3HvibIuT3ZiarcKl5qtMPo&r=HK-CY_tL8CLLA93vdywyu3qI70R4H8oHzZyRHMQu1AQ&m=eXDpcWadyxbrjW5lOO-Vg-tud-0wh7P_EIdo3UxkBjU&s=ajttqwr7ED8_WBG6ALc86GNNTa7qa_WbddCP5AHlUd4&e=>inreplication stress response<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.nature.com_articles_ncomms15847&d=DwMFAg&c=Dbf9zoswcQ-CRvvI7VX5j3HvibIuT3ZiarcKl5qtMPo&r=HK-CY_tL8CLLA93vdywyu3qI70R4H8oHzZyRHMQu1AQ&m=eXDpcWadyxbrjW5lOO-Vg-tud-0wh7P_EIdo3UxkBjU&s=ajttqwr7ED8_WBG6ALc86GNNTa7qa_WbddCP5AHlUd4&e=>


--
John Berrisford
PDBe
European Bioinformatics Institute (EMBL-EBI)
European Molecular Biology Laboratory
Wellcome Trust Genome Campus
Hinxton
Cambridge CB10 1SD UK
Tel: +44 1223 492529

http://www.pdbe.org
http://www.facebook.com/proteindatabank
http://twitter.com/PDBeurope

Re: [ccp4bb] AW: [ccp4bb] question regarding sequence numbering

Reply via email to