Dear colleagues,

I am currently working on the issue https://pulp.plan.io/issues/6121. It
was reported that syncs of very large repositories take a huge amount of
time compared to Pulp 2.

I profiled the code and realized that that we are repeatedly fetching data
from the database in a loop and manually excluding units which should not
be added to a repository mirrored by Pulp.

Today, I submitted a PR (https://github.com/pulp/pulpcore/pull/565) that
may resolve this issue. Instead of the aforesaid loop, there is now used a
single database call which does the same thing, I suppose. Please, focus on
the last note https://pulp.plan.io/issues/6121#note-11 to better understand
my findings. Also, do not hesitate to review the submitted PR.

Thank you!
_______________________________________________
Pulp-dev mailing list
Pulp-dev@redhat.com
https://www.redhat.com/mailman/listinfo/pulp-dev

Reply via email to