Re: When is logging.getLogger(name) needed?

dn via Python-list Wed, 05 Apr 2023 17:36:29 -0700

Thank you for carefully considering suggestions (and implications) - andwhich will 'work' for you.

Further comment below (and with apologies that, unusually for me, thereare many personal opinions mixed-in):-



On 06/04/2023 01.06, Loris Bennett wrote:

"Loris Bennett" <loris.benn...@fu-berlin.de> writes:

dn <pythonl...@danceswithmice.info> writes:

On 01/04/2023 02.01, Loris Bennett wrote:

Hi,
In my top level program file, main.py, I have
    def main_function():
        parser = argparse.ArgumentParser(description="my prog")
        ...
        args = parser.parse_args()
        config = configparser.ConfigParser()
        if args.config_file is None:
            config_file = DEFAULT_CONFIG_FILE
        else:
            config_file = args.config_file
        config.read(config_file)
        logging.config.fileConfig(fname=config_file)
        logger = logging.getLogger(__name__)
        do_some_stuff()
               my_class_instance = myprog.MyClass()
    def do_some_stuff():
        logger.info("Doing stuff")
This does not work, because 'logger' is not known in the function
'do_some_stuff'.
However, if in 'my_prog/my_class.py' I have
    class MyClass:
        def __init__(self):
            logger.debug("created instance of MyClass")
this 'just works'.
I can add
    logger = logging.getLogger(__name__)
to 'do_some_stuff', but why is this necessary in this case but not
in
the class?
Or should I be doing this entirely differently?


Yes: differently.

To complement @Peter's response, two items for consideration:

1 once main_function() has completed, have it return logger and other
such values/constructs. The target-identifiers on the LHS of the
function-call will thus be within the global scope.

2 if the purposes of main_function() are condensed-down to a few
(English, or ..., language) phrases, the word "and" will feature, eg
- configure env according to cmdLN args,
- establish log(s),
- do_some_stuff(),  ** AND **
- instantiate MyClass.

If these (and do_some_stuff(), like MyClass' methods) were split into
separate functions* might you find it easier to see them as separate
sub-solutions? Each sub-solution would be able to contribute to the
whole - the earlier ones as creating (outputting) a description,
constraint, or basis; which becomes input to a later function/method.


So if I want to modify the logging via the command line I might have the
following:

---------------------------------------------------------------------

#!/usr/bin/env python3

import argparse
import logging


def get_logger(log_level):
     """Get global logger"""

     logger = logging.getLogger('example')
     logger.setLevel(log_level)
     ch = logging.StreamHandler()
     formatter = logging.Formatter('%(levelname)s - %(message)s')
     ch.setFormatter(formatter)
     logger.addHandler(ch)

     return logger


def do_stuff():
     """Do some stuff"""

#    logger.info("Doing stuff!")


Looks like I just need

   logger = logging.getLogger('example)
   logger.info("Doing stuff!")


def main():
     """Main"""

     parser = argparse.ArgumentParser()
     parser.add_argument("--log-level", dest="log_level", type=int)
     args = parser.parse_args()

     print(f"log level: {args.log_level}")

     logger = get_logger(args.log_level)
     logger.debug("Logger!")
     do_stuff()


if __name__ == "__main__":
     main()

---------------------------------------------------------------------

How can I get logging for 'do_stuff' in this case without explicitly
passing 'logger' as an argument or using 'global'?

Somehow I am failing to understand how to get 'logger' defined
sufficiently high up in the program that all references 'lower down' in
the program will be automatically resolved.

At the risk of 'heresy', IMHO the idea of main() is (almost always)unnecessary in Python, and largely a habit carried-over from otherlanguages (need for an entry-/end-point).


NB be sure of the difference between a "script" and a "module"...


My script template-overview:

''' Script docstring. '''
- author, license, etc docs

global constants such as import-s
set environment

if __name__ == "__main__":
    do_this()
    do_that()
    ie the business of the script

Despite its frequent use, I'm somewhat amused by the apparentduplication within:


if __name__ == "__main__":
    main()

ie if this .py file is being executed as a script, call main() - wheremain() is the purpose of the script. Whereas if the file is an import-edmodule, do not execute any of the contained-code.

Thus, the if-statement achieves the same separation as the main()function encapsulation. Also, the word main() conveys considerably lessmeaning than (say) establish_logging().



NB others may care to offer alternate advice for your consideration...

There is a good argument for main(), in that it might enable tests to bewritten which assure the integration of several tasks called by thescript. Plus, if one excludes non-TDD test-able code, such as userinterfaces, even 'fancy headings' and printing of conclusions; there'slittle left. Accordingly, I don't mind the apparent duplication involvedin coding an integration test which checks that do_this() and do_that()are playing-nicely together. YMMV!

I'm not sure why, but my 'mainlines' never seem to be very long. Once Ihad been introduced to "Modular Programming" (1970s?), it seemed thatthe purpose of a mainline was merely calling one do-it function afteranother.

To me mainline's seem highly unlikely to be re-usable. Thus, thepossibility that main() might need to be import-able to some otherscript, is beyond my experience/recollection.

Further, my .py file scripts/mainlines tend to be short and often don'tcontain any of the functions or classes which actually do-the-work - allare import-ed (and thus, they are easier to re-use!?)

Returning to 'set environment': the code-examples include argparse andlogger. Both (in my thinking) are part of creating the environment inwhich the code will execute.

Logging for example, is the only choice when we need to be aware of howa web-based system is running (or running into problems) - otherwise, assome would argue, we might as well use print(). Similarly, argparse willoften influence the way a script executes, the data it should use, etc.

Much of such forms the (global) environment in which (this run of) thecode will execute. Hence locating such setting of priorities andconstraints, adjacent to the import statements.

These must be established before getting on with 'the business'. Thatsaid, if someone prefers to put it under if __main__, I won't burst intotears. (see earlier comment about the .py file as a script cf module)



You have answered your own question about logging. Well done!

The logging instance can either be explicitly passed into each(relevant) function, or it can be treated as a global and thus availableimplicitly. (see also: "Zen of Python")

I prefer your approach of get_logger() - even if that name doesn't quitedescribe the establishment of a logger and its settings. All of thatpart of setting the environment is collected into one place. Whichin-turn, makes it easy to work on any changes, or work-in any parameterswhich may come from other environment-setting activity.

As well as 'the documentation' there is a HowTo(https://docs.python.org/3/howto/logging.html). Other logging-learnershave found the DigitalOcean tutorial helpful(https://www.digitalocean.com/community/tutorials/how-to-use-logging-in-python-3)

* there is some debate amongst developers about whether "one function,
   one purpose" should be a rule, a convention, or tossed in the
  trash. YMMV!

Personal view: SOLID's "Single" principle applies: there should be
only one reason (hanging over the head of each method/function, like
the Sword of Damocles) for it to change - or one 'user' who could
demand a change to that function. In other words, an updated cmdLN
option shouldn't affect a function which establishes logging, for
example.


Web.Refs:
https://people.engr.tamu.edu/choe/choe/courses/20fall/315/lectures/slide23-solid.pdf
https://www.hanselminutes.com/145/solid-principles-with-uncle-bob-robert-c-martin
https://idioms.thefreedictionary.com/sword+of+Damocles
https://en.wikipedia.org/wiki/Damocles


I don't really get the "one reason" idea and the Sword of Damocles
analogy.  The later to me is more like "there's always a downside",
since the perks of being king may mean someone might try to usurp the
throne and kill you.  Where is the "single principle" aspect?

However, the idea of "one responsibility" in the sense of "do only one
thing" seems relatively clear, especially if I think in terms of writing
unit tests.

+1

Users are notoriously unable to write clear specifications. Ultimately,and not necessarily unkindly, they often do not know what they want -and particularly not specified at the level of detail we need. This isthe downfall of the "waterfall" SDLC development model, and the reasonwhy short 'Agile' sprints are (often/in-theory) more successful. Thesooner a mistake, misunderstanding, or omission is realised, the better!

Not wanting to do more than state the above, things change, life goeson, etc, etc. So, when one first demonstrates code to a user, it is rarethat they won't want to change something. Similarly, over time, it ishighly unlikely that someone won't dream up some 'improvement' - orsimilar need to amend the code be imposed by an externality, eggovernment legislation or regulation. Such changes may be trivial, egcosmetic; others may be more far-reaching, eg imposition of a tax wherepreviously there was none (or v-v).

Accordingly, adding the fourth dimension to one's code, andprogram[me]-design - and the advice about being kind to those who willmaintain the code after you or your six-months-time-self.

So, the 'sword of Damocles' is knowing that our code should not beconsidered secure (or static). That change is inevitable. We don't needto be worrying about danger afflicting us when we least expect it - yourusers are unlikely to actually kill you. However, it is a good idea tobe aware that change may be required, and that it could come from anyrandom direction or "stakeholder".

Perhaps worse, change implies risk. We make a change to suit one aspect,and something else 'breaks'. That is the reason many programmersactively resist 'change'. That is also one of the promises of TDD - ifwe can quickly run tests which assure (if not "prove") code-correctness,then the risks of change decrease. Whereas, if there are no tests toensure 'life goes on' after a change, well, unhappiness reigns!


Hence the allusion.
(apologies if it was too oblique)

Accordingly, the idea that if a function does one job (and does itwell), should you decide/be required to change that function, then theimpacts, any side-effects, etc, of such will only affect that onefunction (and its tests), and whatever is 'downstream' (integrating)from there.

Which also reduces the chance that some change 'over here', will havesome unforeseen impact 'over there'. The structure and the flow worktogether to achieve both a degree of separation and a bringing-togetheror unity. The tension of team-work if you like, cf the dysfunction ofdisorganisation.

NB not that a 'small change' means running only that one test in TDD -still run the entire suite of tests (auto-magically) to ensure thatthere are no surprises...

Change is inevitable. Thus, consider that your throne as code-author isillusory - the user/client is able/likely to want change.

Then, if the code is organised according to functionality, the loggingenvironment (for example) can be changed without any need to re-codesomewhere else - and/or that tinkering with cmdLN arguments and suchcode-changes can be done quite separately from the establishment of logging.

Ultimately, the smaller the function (think of establishing logging),the more likely it will be able to be re-used in the next assignmentwhich requires a log! (whereas if it is mixed-in with argparse, re-useis unlikely because the cmdLN args will be different)

There's plenty of references about such on the web. Will be happy todiscuss whichever sub-topics might be of-interest, further...


--
Regards,
=dn
--
https://mail.python.org/mailman/listinfo/python-list

Re: When is logging.getLogger(__name__) needed?

Reply via email to

Re: When is logging.getLogger(name) needed?