Thanks so much, Nitesh!

That works beautfully!

Best regards,

Julie

 
On 3/27/18, 4:19 PM, "Turaga, Nitesh" <nitesh.tur...@roswellpark.org> wrote:

    Hi Julie,
    
    I used a tool called BFG cleaner to fix your repository. 
    
    The problem was, two files which were really large
    
    
        Filename              Git id
        -----------------------------------------
        HEK293_site4All.bam | 4643c11b (87.9 MB)
        HEK293_site4All.bed | ba288886 (124.1 MB)
    
    BFG cleaner removes those large files essentially, and cleans your git 
history of any traces of them. These files were in your older RELEASE versions. 
    
    You should be able to work on them now. You need to get a fresh clone of 
the repo.
    
    Best,
    
    Nitesh
    
    
    > On Mar 26, 2018, at 4:03 PM, Zhu, Lihua (Julie) <julie....@umassmed.edu> 
wrote:
    > 
    > Thanks so much, Nitesh!
    > 
    > FYI, I sent an email to bioc-devel (cced Martin) as well. Hope you saw 
it. It would be great if you could send your solutions in response to my email 
to the bioc-devel for others to benefit.
    > 
    > Best regards,
    > 
    > Julie
    > 
    > On 3/26/18, 3:39 PM, "Turaga, Nitesh" <nitesh.tur...@roswellpark.org> 
wrote:
    > 
    >    Hi Julie,
    > 
    > 
    > 
    >    Please send this email to the bioc-devel page. It’s a very valid 
question and I think everyone in the community should benefit from it. I’ll 
take a look at your problem now.
    > 
    > 
    > 
    >           bioc-devel <bioc-devel@r-project.org>
    > 
    > 
    > 
    >    Best,
    > 
    > 
    > 
    >    Nitesh 
    > 
    >> On Mar 26, 2018, at 3:36 PM, Zhu, Lihua (Julie) <julie....@umassmed.edu> 
wrote:
    > 
    >> 
    > 
    >> Nitesh,
    > 
    >> 
    > 
    >> A while ago, Jim suggested to replace two big data files with two 
smaller ones, which I did.
    > 
    >> 
    > 
    >> Oddly, when I tried to import the package GUIDEseq to github today, it 
still mentions these two files being too big, although I do not see these two 
files in my local repository, checked out from the Bioconductor repository. I 
think the files in the older braches affect git push origin master.
    > 
    >> 
    > 
    >> Could you please help? Thanks!
    > 
    >> 
    > 
    >> Best regards,
    > 
    >> 
    > 
    >> Julie
    > 
    >> 
    > 
    >> 
    > 
    >> FYI, I am following the steps outlined at 
https://urldefense.proofpoint.com/v2/url?u=https-3A__bioconductor.org_developers_how-2Dto_git_maintain-2Dgithub-2Dbioc_&d=DwIGaQ&c=WJBj9sUF1mbpVIAf3biu3CPHX4MeRjY_w4DerPlOmhQ&r=3IbW-yoIQpGZOKgd4i2bgmPHhwHHF5gJMlij5cC5bLU&m=7SMEHFD3ozqzqj_0QgfY_0zoZ2gtwVeM_Q96L6wMEWI&s=nyDL7pLE2r9OvuqZb6Z61jfYFWj4xNhVstcmKicw-_A&e=
    > 
    >> 
    > 
    >> JulieZhuMac2017:GUIDEseq ZHUJ$  git push origin master
    > 
    >> Counting objects: 515, done.
    > 
    >> Delta compression using up to 8 threads.
    > 
    >> Compressing objects: 100% (180/180), done.
    > 
    >> Writing objects: 100% (515/515), 107.80 MiB | 18.38 MiB/s, done.
    > 
    >> Total 515 (delta 344), reused 483 (delta 320)
    > 
    >> remote: Resolving deltas: 100% (344/344), done.
    > 
    >> remote: warning: File inst/extdata/HEK293_site4All.bam is 87.88 MB; this 
is larger than GitHub's recommended maximum file size of 50.00 MB
    > 
    >> remote: error: GH001: Large files detected. You may want to try Git 
Large File Storage - 
https://urldefense.proofpoint.com/v2/url?u=https-3A__git-2Dlfs.github.com&d=DwIGaQ&c=WJBj9sUF1mbpVIAf3biu3CPHX4MeRjY_w4DerPlOmhQ&r=3IbW-yoIQpGZOKgd4i2bgmPHhwHHF5gJMlij5cC5bLU&m=7SMEHFD3ozqzqj_0QgfY_0zoZ2gtwVeM_Q96L6wMEWI&s=k0GPlUBYyDVCzblcEc9cLoAEFsFVLD2iUvVUBBx8bBg&e=.
    > 
    >> remote: error: Trace: faafca1a45d562b62039862ac0dfbf85
    > 
    >> remote: error: See 
https://urldefense.proofpoint.com/v2/url?u=http-3A__git.io_iEPt8g&d=DwIGaQ&c=WJBj9sUF1mbpVIAf3biu3CPHX4MeRjY_w4DerPlOmhQ&r=3IbW-yoIQpGZOKgd4i2bgmPHhwHHF5gJMlij5cC5bLU&m=7SMEHFD3ozqzqj_0QgfY_0zoZ2gtwVeM_Q96L6wMEWI&s=ZZhx_cE2THlYAroQbONW_3LMpDaKzMEqV6dOoBMu19I&e=
 for more information.
    > 
    >> remote: error: File inst/extdata/HEK293_site4All.bed is 124.11 MB; this 
exceeds GitHub's file size limit of 100.00 MB
    > 
    >> To 
https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_LihuaJulieZhu_GUIDEseq.git&d=DwIGaQ&c=WJBj9sUF1mbpVIAf3biu3CPHX4MeRjY_w4DerPlOmhQ&r=3IbW-yoIQpGZOKgd4i2bgmPHhwHHF5gJMlij5cC5bLU&m=7SMEHFD3ozqzqj_0QgfY_0zoZ2gtwVeM_Q96L6wMEWI&s=c7Q0No2eps5XP4A1Eh0_Sxz6IMHGUnKi0JEdRZAYTZE&e=
    > 
    >> ! [remote rejected] master -> master (pre-receive hook declined)
    > 
    >> error: failed to push some refs to 
'https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_LihuaJulieZhu_GUIDEseq.git&d=DwIGaQ&c=WJBj9sUF1mbpVIAf3biu3CPHX4MeRjY_w4DerPlOmhQ&r=3IbW-yoIQpGZOKgd4i2bgmPHhwHHF5gJMlij5cC5bLU&m=7SMEHFD3ozqzqj_0QgfY_0zoZ2gtwVeM_Q96L6wMEWI&s=c7Q0No2eps5XP4A1Eh0_Sxz6IMHGUnKi0JEdRZAYTZE&e='
    > 
    >> 
    > 
    >> 
    > 
    >> JulieZhuMac2017:GUIDEseq ZHUJ$ git rm -r inst/extdata/HEK293_site4All.bed
    > 
    >> fatal: pathspec 'inst/extdata/HEK293_site4All.bed' did not match any 
files
    > 
    >> 
    > 
    >> 
    > 
    >> From: "Zhu, Lihua (Julie)" <julie....@umassmed.edu>
    > 
    >> Date: Monday, October 19, 2015 at 1:32 PM
    > 
    >> To: Michael Lawrence <lawrence.mich...@gene.com>, Jim Hester 
<james.hes...@bioconductor.org>
    > 
    >> Subject: Re: Data files in GUIDEseq
    > 
    >> 
    > 
    >> Jim, FYI, I have removed the two large datasets with much smaller 
replacements. Thanks for your feedback!
    > 
    >> 
    > 
    >> Michael, please feel free to check out the new dataset in the following 
commit. Thanks!
    > 
    >> 
    > 
    >> Best regards,
    > 
    >> 
    > 
    >> Julie
    > 
    >> 
    > 
    >> 
    > 
    >> 
    > 
    >> Julie-Zhus-MacBook-Pro-Intel-Core-i7:GUIDEseq zhuj$ svn ci -m "changed 
to smaller test datasets"
    > 
    >> Sending        DESCRIPTION
    > 
    >> Deleting       inst/extdata/HEK293_site4All.bam
    > 
    >> Deleting       inst/extdata/HEK293_site4All.bed
    > 
    >> Adding  (bin)  inst/extdata/bowtie2.HEK293_site4_chr13.sort.bam
    > 
    >> Adding         inst/extdata/bowtie2.HEK293_site4_chr13.sort.bed
    > 
    >> Transmitting file data ...
    > 
    >> Committed revision 109732.
    > 
    >> 
    > 
    >> From: Lihua Julie Zhu <julie....@umassmed.edu>
    > 
    >> Date: Thursday, October 15, 2015 2:16 PM
    > 
    >> To: Michael Lawrence <lawrence.mich...@gene.com>, Jim Hester 
<james.hes...@bioconductor.org>
    > 
    >> Subject: Re: Data files in GUIDEseq
    > 
    >> 
    > 
    >> Yes, we need to fetch a few enriched regions.
    > 
    >> 
    > 
    >> 
    > 
    >> From: Michael Lawrence <lawrence.mich...@gene.com>
    > 
    >> Date: Thursday, October 15, 2015 2:13 PM
    > 
    >> To: Jim Hester <james.hes...@bioconductor.org>
    > 
    >> Cc: Michael Lawrence <lawrence.mich...@gene.com>, Lihua Julie Zhu 
<julie....@umassmed.edu>
    > 
    >> Subject: Re: Data files in GUIDEseq
    > 
    >> 
    > 
    >> Since this is peak calling, we probably want to keep all the reads 
within a smaller region.
    > 
    >> 
    > 
    >> On Thu, Oct 15, 2015 at 11:12 AM, Jim Hester 
<james.hes...@bioconductor.org> wrote:
    > 
    >> FWIW the hard limit that was causing the error with the git mirrors is 
100
    > 
    >> Mb, but I would try and shoot for a file around 20 Mb or so at max.
    > 
    >> 
    > 
    >> An easy way if you just want uniform sampling is using samtools (where -s
    > 
    >> is the fraction of reads you want to keep).
    > 
    >> 
    > 
    >> samtools view  HEK293_site4All.bam -s .1 -b -o HEK293_site4All2.bam
    > 
    >> 
    > 
    >> Thanks,
    > 
    >> 
    > 
    >> Jim
    > 
    >> ​
    > 
    >> 
    > 
    >> On Thu, Oct 15, 2015 at 2:07 PM, Michael Lawrence 
<lawrence.mich...@gene.com
    > 
    >>> wrote:
    > 
    >> 
    > 
    >>> Probably best to come up with smaller files, because we want them in the
    > 
    >>> package for demonstration purposes, right? Just need to filter them,
    > 
    >>> somehow. Actually, all we need is the BAM file...
    > 
    >>> 
    > 
    >>> On Thu, Oct 15, 2015 at 10:43 AM, Zhu, Lihua (Julie) <
    > 
    >>> julie....@umassmed.edu> wrote:
    > 
    >>> 
    > 
    >>>> Thanks, Jim!
    > 
    >>>> 
    > 
    >>>> These files were included for Michael to test functions he is 
developing
    > 
    >> 
    > 
    >> 
    > 
    >> Lihua Julie Zhu, Ph.D.
    > 
    >> Professor
    > 
    >> Molecular, Cell and Cancer Biology (MCCB)
    > 
    >> Head of Bioinformatics Core, MCCB
    > 
    >> Program in Bioinformatics and Integrative Biology
    > 
    >> Program in Molecular Medicine
    > 
    >> University of Massachusetts Medical School
    > 
    >> 364 Plantation Street
    > 
    >> Worcester, MA
    > 
    > 
    > 
    > 
    > 
    >    This email message may contain legally privileged and/or confidential 
information.  If you are not the intended recipient(s), or the employee or 
agent responsible for the delivery of this message to the intended 
recipient(s), you are hereby notified that any disclosure, copying, 
distribution, or use of this email message is prohibited.  If you have received 
this message in error, please notify the sender immediately by e-mail and 
delete this email message from your computer. Thank you.
    
    
    
    This email message may contain legally privileged and/or confidential 
information.  If you are not the intended recipient(s), or the employee or 
agent responsible for the delivery of this message to the intended 
recipient(s), you are hereby notified that any disclosure, copying, 
distribution, or use of this email message is prohibited.  If you have received 
this message in error, please notify the sender immediately by e-mail and 
delete this email message from your computer. Thank you.

_______________________________________________
Bioc-devel@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/bioc-devel

Reply via email to