Re: Organizing modules and their code

dn via Python-list Fri, 03 Feb 2023 21:27:33 -0800

On 04/02/2023 16.24, Thomas Passin wrote:

On 2/3/2023 5:14 PM, 2qdxy4rzwzuui...@potatochowder.com wrote:
Keep It Simple:  Put all four modules at the top level, and run with it
until you falsify it.  Yes, I would give you that same advice no matter
what language you're using.
In my recent message I supported DESIGN 1. But I really don't care muchabout the directory organization. It's designing modules whose businessis to handle various kinds of operations that counts, not so much theactual directory organization.


+1 (and to comments made in preceding post)

With ETL the 'reasons to change' (SRP) come from different 'actors'. Forexample, the data-source may be altered either in format or by changingthe tool you'll utilise to access. Accordingly, the virtue of keeping itseparate from other parts. If you have multiple data-sources, then eachshould be separate for the same reason.

The transform is likely dictated by your client's specification. So,another separation. Hence Design 1.

There is a strong argument for suggesting that we're going out of ourway to imagine problems or future-changes (which may never happen). Ifthis is (definitely?) a one-off, then why-bother? If permanence islikely, (so many 'temporary' solutions end-up lasting years!) thenre-use can?should be considered.

Thus, when it comes to loading the data into your own DB; perhaps thisshould be separate, because it is highly likely that the mechanisms youbuild for loading will be matched by at least one 'someone else' wantingto access the same data for the desired end-purposes. Accordingly, ashareable module and/or class for that.

We can't see the code-structure, so some of the other parts of yourquestion(s) are too broad. Here's hoping you and Liskov have a good timetogether...

My preference is for (what I term) the 'circles' diagram (see copy athttps://mahu.rangi.cloud/CraftingSoftware/CleanArchitecture.jpg). Thisillustrates the 'rule' that code handling the inner functionality notknow what happens at the more detailed/lower-level functional level ofthe outer rings.

With ETL, there's precious little to embody various circles, but thecontent of the outer ring is obvious. The "T" rules comprise the inner"Use Case", even if you eschew "Entities" insofar as OOP-avoidance isconcerned. This 'inversion', where the inner controls don't need to careabout the details of outer-ring implementation (is it an RDBMS, MySQL orPostgres; or is it some NoSQL system?) brings to life the "D" of SOLID,ie Dependency Inversion.

You may pick-up some ideas or reassurance from "Making a Simple DataPipeline Part 1: The ETL Pattern"(https://www.codeproject.com/Articles/5324207/Making-a-Simple-Data-Pipeline-Part-1-The-ETL-Patte).


Let us know how it turns-out...
--
Regards,
=dn
--
https://mail.python.org/mailman/listinfo/python-list

Re: Organizing modules and their code

Reply via email to