On Wed, Oct 29, 2014 at 3:36 AM, Clément Bœsch <u...@pkh.me> wrote:
> On Tue, Oct 28, 2014 at 10:51:27PM +0100, Michael Niedermayer wrote: > > On Tue, Oct 28, 2014 at 07:16:45PM +0100, Clément Bœsch wrote: > > > On Tue, Oct 28, 2014 at 06:30:34PM +0100, Stefano Sabatini wrote: > > > [...] > > > > How much effort would it take to implement the remaining scaling > modes? > > > > > > > > > > According to > > > https://ffmpeg.org/pipermail/ffmpeg-devel/2014-October/164574.html > > > > > > "I think 4x can be done fast enough, but 3x will take time." > > > > > > [...] > > > > > +typedef struct { > > > > > + uint32_t rgbtoyuv[1<<24]; > > > > > > > > We should avoid this 64MiB. Also the table should be possibly static, > > > > so you don't have to fill it per each xBR instance. > > > > > > > > > > So, I requested to do it exactly the same as HQx because this part is > > > common according to the specifications. This should be kept the same > > > vf_hqx, and then factorized. > > > > > > > > Now about removing this allocation, I did benchmark this LUT vs > > > computation (see attached patch for comp. version). And the problem is > > > that it's slightly slower, probably due to the /1000. > > > > why do you divide at all ? > > cant you do the computations with full precission ? > > I wasn't able to... but I was probably doing it wrong. > > And anyway, so far I observed this: > lut: 127fps > nolut+div: 119fps > nolut+nodiv: 123fps > > So even with "fast" computation, it's still slower than the LUT. It > probably > doesn't matter that much in practice, and dropping that huge table might be > worth the performance impact (feel free to discuss). > > Note that looking at the original code (which was working on rgb565 only), > it was bitexact. The rgb 24-bit was added in the "modern" hqx with float > point. So we can probably tolerate the inaccuracy. Still, if you find a > way of keeping full accuracy with the modern implementation... > > Typically, I tried stuff like this: > > const uint32_t y = (uint32_t)((1225*r + 2404*g + 467*b + (1<<11)) >> > 12); > const uint32_t u = (uint32_t)((-692*r - 1356*g + 2048*b + (1<<11)) >> > 12) + 128; > const uint32_t v = (uint32_t)((2048*r - 1716*g - 332*b + (1<<11)) >> > 12) + 128; > > ...but I'm probably doing it very wrong somewhere (sign issue maybe?), > haven't > looked deeper. I went up to 15 bits, still didn't match, so I was probably > doing something stupid. > > > also instead of doing 2 rgb2yuv and then taking their difference you > > can do the difference in rgb space and convert the rgb difference to > > a yuv difference > > its just aM - bM = (a-b)M > > Ah, sounds like a good idea, I guess I'll try that. > > [...] > > -- > Clément B. > > Updated the patch. How should I finally go about converting rgb to yuv? > _______________________________________________ > ffmpeg-devel mailing list > ffmpeg-devel@ffmpeg.org > http://ffmpeg.org/mailman/listinfo/ffmpeg-devel > >
From 905fb0f15033e447fc344097dd649ca671fa1074 Mon Sep 17 00:00:00 2001 From: Arwa Arif <arwaarif1...@gmail.com> Date: Thu, 30 Oct 2014 18:05:45 +0530 Subject: [PATCH] [PATCH]lvafi: add xbr filter --- doc/filters.texi | 81 ++---------- libavfilter/Makefile | 1 + libavfilter/allfilters.c | 1 + libavfilter/vf_xbr.c | 319 ++++++++++++++++++++++++++++++++++++++++++++++ 4 files changed, 335 insertions(+), 67 deletions(-) create mode 100644 libavfilter/vf_xbr.c diff --git a/doc/filters.texi b/doc/filters.texi index 7be29de..253384b 100644 --- a/doc/filters.texi +++ b/doc/filters.texi +@section xbr +Apply high-quality magnification filter which is designed for pixel art. It follows a set +of edge-detection rules @url{http://www.libretro.com/forums/viewtopic.php?f=6&t=134}. +This filter was originally created by Hyllian. + @anchor{yadif} @section yadif diff --git a/libavfilter/Makefile b/libavfilter/Makefile index 6d868e7..2c56e38 100644 --- a/libavfilter/Makefile +++ b/libavfilter/Makefile @@ -198,6 +198,7 @@ OBJS-$(CONFIG_VIDSTABDETECT_FILTER) += vidstabutils.o vf_vidstabdetect. OBJS-$(CONFIG_VIDSTABTRANSFORM_FILTER) += vidstabutils.o vf_vidstabtransform.o OBJS-$(CONFIG_VIGNETTE_FILTER) += vf_vignette.o OBJS-$(CONFIG_W3FDIF_FILTER) += vf_w3fdif.o +OBJS-$(CONFIG_XBR_FILTER) += vf_xbr.o OBJS-$(CONFIG_YADIF_FILTER) += vf_yadif.o OBJS-$(CONFIG_ZMQ_FILTER) += f_zmq.o OBJS-$(CONFIG_ZOOMPAN_FILTER) += vf_zoompan.o diff --git a/libavfilter/allfilters.c b/libavfilter/allfilters.c index d88a9ad..2352d44 100644 --- a/libavfilter/allfilters.c +++ b/libavfilter/allfilters.c @@ -213,6 +213,7 @@ void avfilter_register_all(void) REGISTER_FILTER(VIDSTABTRANSFORM, vidstabtransform, vf); REGISTER_FILTER(VIGNETTE, vignette, vf); REGISTER_FILTER(W3FDIF, w3fdif, vf); + REGISTER_FILTER(XBR, xbr, vf); REGISTER_FILTER(YADIF, yadif, vf); REGISTER_FILTER(ZMQ, zmq, vf); REGISTER_FILTER(ZOOMPAN, zoompan, vf); diff --git a/libavfilter/vf_xbr.c b/libavfilter/vf_xbr.c new file mode 100644 index 0000000..1a828d8 --- /dev/null +++ b/libavfilter/vf_xbr.c @@ -0,0 +1,319 @@ +/* + * This file is part of FFmpeg. + * + * Copyright (c) 2014 Arwa Arif <arwaarif1...@gmail.com> + * + * FFmpeg is free software; you can redistribute it and/or + * modify it under the terms of the GNU Lesser General Public + * License as published by the Free Software Foundation; either + * version 2.1 of the License, or (at your option) any later version. + * + * FFmpeg is distributed in the hope that it will be useful, + * but WITHOUT ANY WARRANTY; without even the implied warranty of + * MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + * Lesser General Public License for more details. + * + * You should have received a copy of the GNU Lesser General Public + * License along with FFmpeg; if not, write to the Free Software + * Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301 USA + */ + +/** + * @file + * XBR Filter is used for depixelization of image. + * This is based on Hyllian's 2xBR shader. + * 2xBR Filter v0.2.5 + * @see : http://www.libretro.com/forums/viewtopic.php?f=6&t=134 + * Future work : To implement x3 and x4 scale, and threading. + */ + +#include "libavutil/opt.h" +#include "libavutil/avassert.h" +#include "libavutil/pixdesc.h" +#include "internal.h" + +typedef struct { + uint32_t rgbtoyuv[1<<24]; +} xBRContext; + +/** +* Calculates the weight of difference of the pixels, by transforming these +* pixels into their Y'UV parts. It then uses the threshold used by HQx filters: +* 48*Y + 7*U + 6*V, to give it those smooth looking edges. +**/ +static int d(AVFrame *in, int x1, int y1, int x2, int y2, const uint32_t *r2y) +{ + +#define YMASK 0xff0000 +#define UMASK 0x00ff00 +#define VMASK 0x0000ff + + int r1 = *(in->data[0] + y1 * in->linesize[0] + x1*3); + int g1 = *(in->data[0] + y1 * in->linesize[0] + x1*3 + 1); + int b1 = *(in->data[0] + y1 * in->linesize[0] + x1*3 + 2); + + int r2 = *(in->data[0] + y2 * in->linesize[0] + x2*3); + int g2 = *(in->data[0] + y2 * in->linesize[0] + x2*3 + 1); + int b2 = *(in->data[0] + y2 * in->linesize[0] + x2*3 + 2); + + uint32_t c1 = r1 | g1<<8 | b1<<16 ; + uint32_t c2 = r2 | g2<<8 | b2<<16 ; + + uint32_t yuv1 = r2y[c1 & 0xffffff]; + uint32_t yuv2 = r2y[c2 & 0xffffff]; + + return abs((yuv1 & YMASK) - (yuv2 & YMASK)) > (48 << 16) || + abs((yuv1 & UMASK) - (yuv2 & UMASK)) > ( 7 << 8) || + abs((yuv1 & VMASK) - (yuv2 & VMASK)) > ( 6 << 0); +} + +/** +* Mixes a pixel A, with pixel B, with B's transparency set to 'a' +* In other words, A is a solid color (bottom) and B is a transparent color (top) +**/ +static int mix(AVFrame *in, int x1, int y1, int x2, int y2, int a, int color) +{ + int col1,col2; + col1 = *(in->data[0] + y1 * in->linesize[0] + x1*3 + color); + col2 = *(in->data[0] + y2 * in->linesize[0] + x2*3 + color); + + return (a*col2 + (2-a)*col1)/2; +}; + +/** +* Fills the output matrix +**/ +static void fill(AVFrame *in, AVFrame *out, int u, int v, int x, int y, int mode, int new_x, int new_y) +{ + int r,g,b; + /*mix colors if they are not on boundary*/ + if (mode!=0 && u>=0 && v>=0 && u<in->width && v<in->height) { + + r = mix(in, u, v, x, y, 1, 0); + g = mix(in, u, v, x, y, 1, 1); + b = mix(in, u, v, x, y, 1, 2); + + } else { + + r = *(in->data[0] + y*in->linesize[0] + x*3); + g = *(in->data[0] + y*in->linesize[0] + x*3 + 1); + b = *(in->data[0] + y*in->linesize[0] + x*3 + 2); + } + + /*Insert blended color into scaledImageData*/ + *(out->data[0] + (new_y)*out->linesize[0] + (new_x)*3) = r; + *(out->data[0] + (new_y)*out->linesize[0] + (new_x)*3 + 1) = g; + *(out->data[0] + (new_y)*out->linesize[0] + (new_x)*3 + 2) = b; + + return; +} + +/** +* Applies the xBR filter rules. +**/ +static void apply_edge_detection_rules(AVFrame *in, AVFrame *out, int x, int y, const uint32_t *r2y) +{ + /* Matrix: (E is 0,0 i.e: current pixel) + -2 | -1| 0| +1| +2 (x) + ______________________________ + -2 | [A1][B1][C1] + -1 | [A0][ A][ B][ C][C4] + 0 | [D0][ D][ E][ F][F4] + +1 | [G0][ G][ H][ I][I4] + +2 | [G5][H5][I5] + |(y)| + */ + + /*Cached Pixel Weight Difference*/ + int d_E_D = d(in, x, y, x-1, y, r2y); + int d_E_B = d(in, x, y, x, y-1, r2y); + int d_E_F = d(in, x, y, x+1, y, r2y); + int d_E_H = d(in, x, y, x, y+1, r2y); + int d_E_G = d(in, x, y, x-1, y+1, r2y); + int d_E_C = d(in, x, y, x+1, y-1, r2y); + int d_A_D0 = d(in, x-1, y-1, x-2, y, r2y); + int d_A_B1 = d(in, x-1, y-1, x, y-2, r2y); + int d_D_B = d(in, x-1, y, x, y-1, r2y); + int d_D_H = d(in, x-1, y, x, y+1, r2y); + int d_D_A0 = d(in, x-1, y, x-2, y-1, r2y); + int d_B_F = d(in, x, y-1, x+1, y, r2y); + int d_B_A1 = d(in, x, y-1, x-1, y-2, r2y); + int d_E_A = d(in, x, y, x-1, y-1, r2y); + int d_E_I = d(in, x, y, x+1, y+1, r2y); + int d_C_F4 = d(in, x+1, y-1, x+2, y, r2y); + int d_C_B1 = d(in, x+1, y-1, x, y-2, r2y); + int d_F_H = d(in, x+1, y, x, y+1, r2y); + int d_F_C4 = d(in, x+1, y, x+2, y-1, r2y); + int d_B_C1 = d(in, x, y-1, x+1, y-2, r2y); + int d_G_D0 = d(in, x-1, y+1, x-2, y, r2y); + int d_G_H5 = d(in, x-1, y+1, x, y+2, r2y); + int d_H_G5 = d(in, x, y+1, x-1, y+2, r2y); + int d_D_G0 = d(in, x-1, y, x-2, y+1, r2y); + int d_I_F4 = d(in, x+1, y+1, x+2, y, r2y); + int d_I_H5 = d(in, x+1, y+1, x, y+2, r2y); + int d_H_I5 = d(in, x, y+1, x+1, y+2, r2y); + int d_H_I4 = d(in, x, y+1, x+2, y+1, r2y); + + /** + * Note: On reading edge detection rules + * + * Each edge rule is an if..else statement, everytime on else, the + * current pixel color pointed to by matrix[0] is used to color it's edge. + * + * Each if statement checks wether the sum of weight difference on the left is + * lesser than that of the right weight differece. + */ + + /** + * Top Left Edge Detection Rule + **/ + if ((d_E_G+d_E_C+d_A_D0+d_A_B1+(4*d_D_B)) < (d_D_H+d_D_A0+d_B_F+d_B_A1+(4*d_E_A))) { + // Figure what color to blend with current pixel -->E + if (d_E_D <= d_E_B) + fill(in, out, x-1, y, x, y, 1, x*2, y*2); + else + fill(in, out, x, y-1, x, y, 1, x*2, y*2); + } else { + /*Insert current pixel color into output frame*/ + fill(in, out, x, y, x, y, 0, x*2, y*2); + } + /** + * Top Right Edge Detection Rule + **/ + if ((d_E_I+d_E_A+d_C_F4+d_C_B1+(4*d_B_F)) < (d_F_H+d_F_C4+d_D_B+d_B_C1+(4*d_E_C))) { + // Figure what color to blend with current pixel --> E + if (d_E_B <= d_E_F) + fill(in, out, x, y-1, x, y, 1, (x*2)+1, y*2); + else + fill(in, out, x+1, y, x, y, 1, (x*2)+1, y*2); + } else { + /*Insert current pixel color into output frame*/ + fill(in, out, x, y, x, y, 0, (x*2)+1, y*2); + } + + /** + * Bottom Left Edge Detection Rule + **/ + if ((d_E_A+d_E_I+d_G_D0+d_G_H5+(4*d_D_H)) < (d_D_B+d_D_G0+d_F_H+d_H_G5+(4*d_E_G))) { + // Figure what color to blend with current pixel --> E + if (d_E_D <= d_E_H) + fill(in, out, x-1, y, x, y, 1, x*2, (y*2)+1); + else + fill(in, out, x, y+1, x, y, 1, x*2, (y*2)+1); + + } else { + /*Insert current pixel color into output frame*/ + fill(in, out, x, y, x, y, 0, x*2, (y*2)+1); + } + + /** + * Bottom Right Edge Detection Rule + **/ + if ((d_E_C+d_E_G+d_I_F4+d_I_H5+(4*d_F_H)) < (d_D_H+d_H_I5+d_H_I4+d_B_F+(4*d_E_I))) { + // Figure what color to blend with current pixel --> E + if (d_E_F <= d_E_H) + fill(in, out, x+1, y, x, y, 1, (x*2)+1, (y*2)+1); + else + fill(in, out, x, y+1, x, y, 1, (x*2)+1, (y*2)+1); + + } else { + /*Insert current pixel color into output frame*/ + fill(in, out, x, y, x, y, 0, (x*2)+1, (y*2)+1); + } +} + +static int config_output(AVFilterLink *outlink) +{ + AVFilterContext *ctx = outlink->src; + AVFilterLink *inlink = ctx->inputs[0]; + + outlink->w = inlink->w * 2 ; + outlink->h = inlink->h * 2 ; + return 0; +} + +static int query_formats(AVFilterContext *ctx) +{ + static const enum AVPixelFormat pix_fmts[] = { + AV_PIX_FMT_RGB24, AV_PIX_FMT_BGR24,AV_PIX_FMT_NONE, + }; + + ff_set_common_formats(ctx, ff_make_format_list(pix_fmts)); + return 0; +} + +static int filter_frame(AVFilterLink *inlink, AVFrame *in) +{ + AVFilterContext *ctx = inlink->dst; + AVFilterLink *outlink = ctx->outputs[0]; + int i,j; + xBRContext *xBR = ctx->priv; + const uint32_t *r2y = xBR->rgbtoyuv; + AVFrame *out = ff_get_video_buffer(outlink, outlink->w, outlink->h); + if (!out) { + av_frame_free(&in); + return AVERROR(ENOMEM); + } + + av_frame_copy_props(out, in); + for (i=0; i<inlink->w; i++) + for (j=0; j<inlink->h; j++) + apply_edge_detection_rules(in, out, i, j, r2y); + + out->width = outlink->w; + out->height = outlink->h; + + av_frame_free(&in); + return ff_filter_frame(outlink, out); +} + +static int init(AVFilterContext *ctx) +{ + xBRContext *xbr = ctx->priv; + uint32_t c; + int bg, rg, g; + + for (bg=-255; bg<256; bg++) { + for (rg=-255; rg<256; rg++) { + const uint32_t u = (uint32_t)((-169*rg + 500*bg)/1000) + 128; + const uint32_t v = (uint32_t)(( 500*rg - 81*bg)/1000) + 128; + int startg = FFMAX3(-bg, -rg, 0); + int endg = FFMIN3(255-bg, 255-rg, 255); + uint32_t y = (uint32_t)(( 299*rg + 1000*startg + 114*bg)/1000); + c = bg + (rg<<16) + 0x010101 * startg; + for (g = startg; g <= endg; g++) { + xbr->rgbtoyuv[c] = ((y++) << 16) + (u << 8) + v; + c+= 0x010101; + } + } + } + return 0; +} + +static const AVFilterPad xbr_inputs[] = { + { + .name = "default", + .type = AVMEDIA_TYPE_VIDEO, + .filter_frame = filter_frame, + }, + { NULL } +}; + +static const AVFilterPad xbr_outputs[] = { + { + .name = "default", + .type = AVMEDIA_TYPE_VIDEO, + .config_props = config_output, + }, + { NULL } +}; + +AVFilter ff_vf_xbr = { + .name = "xbr", + .description = NULL_IF_CONFIG_SMALL("Scale the input by 2 using xbr algorithm."), + .priv_size = sizeof(xBRContext), + .inputs = xbr_inputs, + .outputs = xbr_outputs, + .query_formats = query_formats, + .init = init, +}; -- 1.7.9.5
_______________________________________________ ffmpeg-devel mailing list ffmpeg-devel@ffmpeg.org http://ffmpeg.org/mailman/listinfo/ffmpeg-devel