On Tue, Apr 05, 2016 at 11:09:30AM -0700, Alexei Starovoitov wrote:
> >>@@ -67,6 +69,14 @@ perf_trace_##call(void *__data, proto)                   
> >>                \
> >>                                                                    \
> >>    { assign; }                                                     \
> >>                                                                    \
> >>+   if (prog) {                                                     \
> >>+           *(struct pt_regs **)entry = __regs;                     \
> >>+           if (!trace_call_bpf(prog, entry) || hlist_empty(head)) { \
> >>+                   perf_swevent_put_recursion_context(rctx);       \
> >>+                   return;                                         \
> >>+           }                                                       \
> >>+           memset(&entry->ent, 0, sizeof(entry->ent));             \
> >
> >But if not, you destroy it and then feed it to perf?
> 
> yes. If bpf prog returns 1 the buffer goes into normal ring-buffer
> with all perf_event attributes and so on.
> So far there wasn't a single real use case where we went this path.
> Programs always do aggregation inside and pass stuff to user space
> either via bpf maps or via bpf_perf_event_output() helper.
> I wanted to keep perf_trace_xx() calls to be minimal in .text size
> so memset above is one x86 instruction, but I don't mind
> replacing this memset with a call to a helper function that will do:
>    local_save_flags(flags);
>    tracing_generic_entry_update(entry, flags, preempt_count());
>    entry->type = type;
> Then whether bpf attached or not the ring buffer will see the same
> raw tracepoint entry. You think it's cleaner?

Yeah, otherwise you get very weird and surprising behaviour.

Also, one possible use-case is dynamic filters where the BPF program is
basically used to filter events, although I suppose we already have a
hook for that elsewhere.

Reply via email to