That's a cool idea Leah. Also John - maybe there is a way we can reach out to Amazon Open-Telemetry team? I know Amazon is very, very active in this. This might be a great idea to reach for industry feedback.
I actually also will reach out to my friends in SumoLogic who also are very actively working on OpenTelemetry. J. On Thu, May 26, 2022 at 1:56 AM Leah Cole <colel...@google.com.invalid> wrote: > > I've shared this with some of my colleagues who work in OpenTelemetry and > hopefully they'll be able to chime in in the next few days :) > > On Sun, May 22, 2022 at 12:32 PM Howard Yoo <howard...@gmail.com> wrote: >> >> I agree! :-) looking forward to the Airflow Summit next week! >> Howard >> >> On Sun, May 22, 2022 at 10:29 AM Jarek Potiuk <ja...@potiuk.com> wrote: >>> >>> I think once we get some of the people who commented (Elad/Malthe) >>> confirm that their comments were addressed and maybe some other voices >>> of support, it could be ready for a Voting attempt actually :). I'd >>> wait however with it till after the Summit (like with few other >>> discussions we are having now). >>> >>> J >>> >>> On Sun, May 22, 2022 at 4:53 PM Howard Yoo <howard...@gmail.com> wrote: >>> > >>> > But isn't the span uniquely identified by the task instance and attempt >>> > number? >>> > --> True, but the thing about OpenTelemetry is that span ID and trace ID >>> > are in UUID format (and has to be), so unless we devise a way to uniquely >>> > create the task instance and attempt number into UUID format (I guess we >>> > could kind of do it, technically?), those identification cannot be >>> > directly used, hence the need to persist the span information somewhere >>> > for later retrieval and 'ending' it when the dag run or task instance >>> > ends. >>> > >>> > Yes, the only problem with the logging part with OpenTelemetry right now, >>> > is that the logging was the latest addition to it, and thus will be >>> > subjected to many changes and additions. This AIP does guarantee, >>> > however, that it will include the logging feature, and according to the >>> > opentelemetry docs, logging will be designed and implemented in such a >>> > way that will try to encompass majority of existing logging structures >>> > and schemes since the project understands that logging is a well >>> > established practices. >>> > >>> > Would be more than happy to get this AIP approved and going to address >>> > logging part also. We are still waiting for it to get voted! >>> > >>> > Howard >>> > >>> > On Sat, May 21, 2022 at 2:22 AM Malthe <mbo...@gmail.com> wrote: >>> >> >>> >> On Wed, 18 May 2022 at 16:44, Howard Yoo <howard...@gmail.com> wrote: >>> >> > 2. So, the reason why I ended up implementing span_json was that >>> >> > between the scheduler who submits the tasks to be processed, and the >>> >> > worker that needs to pick them up from the queue (which is implemented >>> >> > in meta database of airflow) - needs to get the current span in some >>> >> > way. It looked like every time the worker gets dagrun or task instance >>> >> > it does so via databases, so in my POC, it was necessary to have means >>> >> > to persist the current 'span' in the database tables. Well, dagrun and >>> >> > task instances do not have anything related to storing spans, so had >>> >> > to implement some method to convert the span objects into json and >>> >> > store them. >>> >> >>> >> But isn't the span uniquely identified by the task instance and attempt >>> >> number? >>> >> >>> >> I would think that you've already got all the information without >>> >> persisting any additional data. >>> >> >>> >> > 3. Yes, I believe the logs will be included into the scope of AIP, >>> >> > even in the draft stage (at least that is what I hope). However, it >>> >> > may be implemented following the initial implementation of metrics and >>> >> > traces. >>> >> >>> >> To me what is the most important and what right now would motivate me >>> >> personally in helping out here is to get task execution logs (both >>> >> worker-based on asynchronous, task deferreds) out using OpenTelemetry. >>> >> That's because right now there is a bit of a broken logging story if >>> >> you're using deferred tasks and distributed logging is really the only >>> >> fix as far as I can tell. >>> >> >>> >> Cheers > > > > -- > > Leah Cole (she/her) | Developer Programs Engineer, Data Analytics | > colel...@google.com | +1 (925) 257-2112 > My working hours may not be your working hours. Please ping me anytime if > you'd like a status update on anything we are working on together - my goal > is to never be a blocker for you. >