Gentoo Websites Logo
Go to: Gentoo Home Documentation Forums Lists Bugs Planet Store Wiki Get Gentoo!
Bug 778035 - media-video/ffmpeg: allow use of sse3 without sse2 as sse3 is an extension of sse2
Summary: media-video/ffmpeg: allow use of sse3 without sse2 as sse3 is an extension of...
Status: RESOLVED FIXED
Alias: None
Product: Gentoo Linux
Classification: Unclassified
Component: Current packages (show other bugs)
Hardware: All Linux
: Normal normal
Assignee: Gentoo Media-video project
URL:
Whiteboard:
Keywords:
Depends on:
Blocks:
 
Reported: 2021-03-24 22:44 UTC by Igor Franchuk
Modified: 2025-03-11 09:03 UTC (History)
0 users

See Also:
Package list:
Runtime testing required: ---


Attachments

Note You need to log in before you can comment on or make changes to this bug.
Description Igor Franchuk 2021-03-24 22:44:09 UTC
ffmpeg ebuilds would fail if CPU_FLAGS_X86 omits sse2 like

CPU_FLAGS_X86="mmx mmxext sse sse3"

with  
The following REQUIRED_USE flag constraints are unsatisfied:
cpu_flags_x86_sse3? ( cpu_flags_x86_sse2 )

which looks wrong because sse3 implies sse2 sse support so it should be something like CPU_FLAGS_X86="mmxext sse3" be enough to compile ffmpeg (correct me if I'm wrong) 

sse3 -> implies (sse2,sse) support (extends sse2)
sse2 -> implies (sse) support (extends sse)

mmxe -> implies (mmx) support (extends mmx)

sse3,sse2, sse co-exists with mmx and mmx3 which are different instruction sets


why then this logic:

	cpu_flags_x86_sse3?  ( cpu_flags_x86_sse2 )
	cpu_flags_x86_sse2?  ( cpu_flags_x86_sse )
	cpu_flags_x86_sse?  ( cpu_flags_x86_mmxext )



Reproducible: Always
Comment 1 Igor Franchuk 2021-03-24 23:02:07 UTC
I meant sse3,sse2, sse co-exists with mmx and mmxext

question: is it possible at all to support sse3 without supporting sse2 and sse? 

sse3 = sse2 + extended instructions
sse2 = sse + extended instructions 

mmxext = mmx + extended instructions 

it's not like sse2 is just extended instructions for sse2 it consists of both 

https://www.intel.com/content/www/us/en/support/articles/000005779/processors.html
Comment 2 Niklāvs Koļesņikovs 2021-03-26 09:12:49 UTC
And avx2 implies the CPU has not only sse3 but also ssse3 and aes and almost certainly also sse4_2. By extending the logic of this bug report, the user needs but specify the highest one. Does this have an actual use case?

It's also worth reminding that the Gentoo recommendation is to set all supported CPU_FLAGS_* as reported by cpuid2cpuflags with the sensible exception being removing ones that either user does not want (e.g. the so called power-virus) or ones that are supported by the host system but may cause trouble with systems to which the built binaries may be distributed to. Speaking of that, FFmpeg has cpudetection that is very handy for that. ;)

Finally, not all ebuilds have sse3 (a rather rare instruction set to see in the wild), so sse2 should be generally set anyway.
Comment 3 Igor Franchuk 2021-03-26 12:47:27 UTC
(In reply to Niklāvs Koļesņikovs from comment #2)
> And avx2 implies the CPU has not only sse3 but also ssse3 and aes and almost
> certainly also sse4_2. 

Yes and it looks like the cpu_flags should only have avx2 in that case

CPU_FLAGS_X86 = 'avx2' 

A portage function like cpu_flags('avx2') could return the full list like 'avx2 sse2, sse, etc). I suggest to transform the CPU_FLAGS_X86 via 

CPU_FLAGS_X86 = cpu_flags(CPU_FLAGS_X86) 

first and then the backward compatibility is met. (uniqueness of fags would be 
provided by cpu_flags function)


> By extending the logic of this bug report, the user
> needs but specify the highest one. Does this have an actual use case?

 I've personally seen a few libs in the portage that won't build with both sse2 and sse3 flags set. As to how often that problem occurs - try googling for: "The following REQUIRED_USE flag constraints are unsatisfied: cpu_flags_x86_sse3? ( cpu_flags_x86_sse2 )"

> It's also worth reminding that the Gentoo recommendation is to set all
> supported CPU_FLAGS_* as reported by cpuid2cpuflags with the sensible
> exception being removing ones that either user does not want (e.g. the so
> called power-virus) or ones that are supported by the host system but may
> cause trouble with systems to which the built binaries may be distributed
> to. Speaking of that, FFmpeg has cpudetection that is very handy for that. ;)
> 
> Finally, not all ebuilds have sse3 (a rather rare instruction set to see in
> the wild), so sse2 should be generally set anyway.

Transforming CPU_FLAGS from make.conf via cpu_flags function before using them should fix the problem like forever. That would give control over the flags to the portage. Less probability for a user to set them wrong. All so that would give an early diagnosis to the wrong set of flags set by a user.

The user would from that moment only have to set for example 

CPU_FLAGS_X86="mmxext sse3"

portage would expands that to CPU_FLAGS_X86="mmx mmxext sse sse3 sse3" via cpu_flags functions, the backward compatibility is met for the ebuilds. No logic problems with choosing CPU_FLAGS_X86 from the user side. 

Still if a user prefers to set CPU_FLAGS_X86="mmx mmxext sse sse3 sse3" he can do it, cpu_flags would just return the same "mmx mmxext sse sse3 sse3" in that case and go on. 

CPU_FLAGS_X86="sse2" would be transformed by cpu_flags to CPU_FLAGS_X86="sse sse2" 

I can't think of any case when this wouldn't work. Portage will take care of sse flags, a cool feature.
Comment 4 Larry the Git Cow gentoo-dev 2025-03-11 09:03:46 UTC
The bug has been closed via the following commit(s):

https://gitweb.gentoo.org/repo/gentoo.git/commit/?id=00eddc65d7bd9ecaa9973fd46e544bbbfe5c9479

commit 00eddc65d7bd9ecaa9973fd46e544bbbfe5c9479
Author:     Ionen Wolkens <ionen@gentoo.org>
AuthorDate: 2025-03-07 13:02:23 +0000
Commit:     Ionen Wolkens <ionen@gentoo.org>
CommitDate: 2025-03-11 09:03:17 +0000

    media-video/ffmpeg: rewrite live ebuild
    
    This does not list all changes compared to original due to being
    essentially a rewrite (not much sense in looking at the diff
    given almost no lines in common), but to list or explain notable
    ones:
    
    * Cleanup a lot of obsolete logic, like mirror://gentoo snapshots,
      SCM type, <openssl-3 handling, etc...
    
    * Re-arrange IUSE=soc handling to drop the IUSE when FFMPEG_SOC_PATCH
      is empty until maintainer of the patch has time to look at that
      (easier to bump without it and removes the need to mask in 9999)
    
    * Drop some IUSE, ffmpeg has a bit too many often confusing users and,
      while debatable, some are not really useful to control bare at most
      for expert user that could use EXTRA_ECONF if *really* wanted:
    
      - debug (disabled): does very little beside override users' -g*,
        albeit if want nasm debug symbols will need to set X86ASMFLAGS.
    
      - encode (enabled): debated keeping it for disabling encoders
        without their own USE for bit of size/build time reduction despite
        no dependencies, but feel it is not really worth it as it
        complicates the ebuild and can break expectations for a lot
        of revdeps (many likely lack the [encode] that they need) --
        removal also handles bug #770439
    
      - fftools_* (disabled): almost no distros install these, many have
        rather generic names, we were missing installing new ones,
        upstream ffmpeg does not even have an install target for them,
        and a global USE_EXPAND for this feels wasteful (*could* consider
        adding a all-in-one IUSE=tools if some are wanted by users anyway
        but let us try without and see if anyone complains)
    
      - glslang (disabled): redundant with USE=shaderc and is a pain to
        use directly given glslang often breaks things, esp. worse for
        packages with a slow release cycle and old LTS versions like
        ffmpeg (bug #918989, bug #920283, bug #922333, bug #922433) --
        upstream also seems to favour shaderc: "libshaderc and libglslang
        are mutually exclusive, if in doubt, disable libglslang"
        (from ./configure)
    
      - harcoded-tables (disabled): off by default upstream and listed as a
        "experts only" option that seemingly provide little gains for a
        notable size increase, do not believe it is worth exposing as a USE
    
      - iconv (enabled): virtual/libiconv is essentially free anywhere but
        prefix and disabling iconv can be rather broken for a few things
    
      - network (enabled): no dependencies, fairly small, and many revdeps
        likely expect it to be enabled and just missing the [network]
        (tempting to do the same with USE=postproc but it would still be
        conditional through USE=gpl and be more confusing, and then forcing
        GPL also has its own issues)
    
      - oss (auto): does not have a dedicated option and just checks
        linux-headers and is very small, hardly anyone need this
        but that argument could be applied to a *ton* of in/outdevs,
        filters, de/encoders, and such and we would never see the end
        of it
    
      - pic (enabled): makes little sense not to enable nowadays and
        handles bug #903754 + bug #916067 -- USE had the secondary
        effect of disabling x86-32 asm which still doing given it is
        difficult to support either way due to textrels and
        >=binutils-2.41's enforcing (bug #916067, bug #920808), and
        it is possibly broken for older hardware (bug #733556) -- as
        a minor bonus, this should also let LTO work with abi_x86_32
        (bug #654118)
    
      - static-libs (disabled): cannot think of much of a use for this
        esp. given how massive these libraries are and it would still
        require to link with all its shared-only dependencies
    
      - test (irrelevant): tests are not gone but there is no dependencies
        (bc and wget are not needed) nor REQUIRED_USE to handle anymore,
        thus USE is useless
    
      - threads (auto): no real sense in disabling (bug #868471), and
        leaving it auto let ffmpeg figure out the right implementation
        to use which in most cases means default --enable-pthreads
    
      - cpu_flags_* + cpudetection (enabled): tentatively anyway, just
        let ffmpeg do its thing normally -- simplifies the ebuild a lot,
        "should" fix bug #915384, and also handles whatever ebuild
        was doing wrong wrt bug #752507 + bug #778035 (one minor "issue"
        however is that all these flags being removed will leave users
        confused as to what happened, if reading this and worried about
        optimizations.. then no they are *not* disabled, and it is fine
        for other machines thanks to detection at runtime)
    
    * New IUSE:
    
      - openmpt: requested in bug #921313
    
      - npp: requested in bug #695210 -- note have not tested the
        arm64 (sbsa) support, but it should work in theory as cuda
        toolkit does install npp prebuilt arm64 libraries
    
      - rabbitmq: trivial, may as well add support
    
      (meant to add more but others did not fare as well and will leave
      for later rather than during big changes, kept some minor notes in
      the ebuild about things still not supported)
    
    * Renamed IUSE:
    
      - mp3->lame: this is only for encoding but it made it "sound"
        needed for decoding (not the only case like this but this one
        felt too generic), may surprise those that had it actually
        enabled for encoding
    
      - libxml2->xml and libdrm->drm: given newly enabling these by
        default (see below), it's a good occasion to rename to the
        more commonly used USE without causing issues
    
      - libv4l->v4l: to be precise IUSE=v4l was removed and it gained
        the meaning that IUSE=libv4l had -- old v4l is always enabled
        (if linux) due to the hard dependency on linux-headers (see below)
        -- given the old REQUIRED_USE with libv4l? ( v4l ), all users of
        libv4l will have the new USE enabled already
    
      - libdvdnav+libdvdread->dvd(combined): similar to mpv, simpler
        for users when they just want dvds to work than worry about
        each library (dvd support is new in ffmpeg-7, so this only
        ever been masked)
    
    * Enable a few extra IUSE by default that, while not everyone needs,
      are useful and have dependencies that many users will already have.
    
      - libass,truetype,fontconfig: for proper text/subtitles drawing
        (kdenlive also wants [libass]), albeit video players tend to
        have their own handling (that pull the same depdencies)
    
      - drm: used in various pieces of code and little reason to
        disable when mesa depends on libdrm either way
    
      - xml: needed for DASH-type streams, notably used by youtube,
        and about everyone has libxml2
    
      (one downside of new defaults is MULTILIB_USEDEP, users that only
      selectively enable abi_x86_32 rather than globally may be faced with
      conflicts when updating)
    
    * Always depend on linux-headers if linux, not *required* (except for
      v4l) but this prevents a lot of automagic and going through alternate
      code paths if missing (also go with >=6 which is needed for v4l2m2m
      and I believe also USE=soc for v4l2request)
    
    * Add missing vulkan-headers dependency wrt bug #865201, and also
      opencl-headers wrt bug #941506 (not 100% sure that this is what
      happened for the latter, but given binhost is setup it is likely).
    
    * Bump nv-codec-headers min req to 12.1.14.0, while not *required*
      it spares us for setting up ranges and 11.x is due for cleanup.
    
    * Add missing x11-base/xorg-proto wrt bug #934196 -- technically
      was longer an issue given libX11 RDEPENDs on it "now", but would
      still rather keep these for now.
    
    * Depend on >=librsvg-2.52 wrt bug #918613, USE=svg is not essential
      for arches without rust and upstream bug been went 1.5 years without
      activity.
    
    * Update REQUIRED_USE a bit, cuda does not *need* nvenc (just get
      one filter in less), libplacebo does nothing without sdl(ffplay)
      or vulkan, and few others like these (likely still missing some).
    
    * Drop musl HAVE_POSIX_IOCTL workaround, this is automatically set by
      ./configure since ffmpeg-7.1. However, note that there is another
      related issue with libv4l (not v4l) that is not fixed (bug #911499).
    
    * No longer ignore ffmpeg-spec-disposition test, fixed in 7.1.1.
    
    * Fix verify-sig paths on prefix wrt bug #947253 (untested)
    
    * No longer pass e.g. --disable-indev=alsa when already doing
      --disable-alsa, ./configure seem to figure this out just fine.
    
    * No longer pass --cpu, this expects the -march=<exact> reported by
      the compiler or probes compiler with =native if =host), however it
      does not account for -mno-* both passed or implied by =native and
      could enable things it shouldn't. Furthermore, it adds -march=<exact>
      after the users' -march=native (which is more correct, can enable
      or disable additional features). This option only seem like trouble
      and likely does not get tested much. Removing also handles
      bug #830165 albeit ffmpeg-chromium is still affected.
    
    * Replace EXTRA_FFMPEG_CONF by EXTRA_ECONF, while it is not using
      econf nor autoconf, the latter is more widely known while the former
      needs users to look at the ebuild -- also die if old variable is set
      so users know about this change.
    
    * Skip installing doc/*.txt, that seems mostly oriented at ffmpeg devs,
      and feels noisy in the docdir, but do keep doc/APIChanges
    
    * Replace all *GPL* by the + version given ./configure always output
      with "or later" when listing the final license.
    
    * Maybe not best way to go about this, but when doing incompatible
      --enable-gpl --enable-nonfree, on top of the RESTRICT="bindist"
      also add "all-rights-reserved" so a typical freedom ACCEPT_LICENSE
      can block it.
    
    * Rearrange chromium patch a bit to need less ebuild logic, and make
      it unused for ffmpeg-compat.
    
    * Drop EGIT_MIN_CLONE_TYPE, shallow clones seem fine now? If not,
      could consider just using the official github mirror by default.
    
    List of closed bugs may not be complete, plans are to recheck
    still-open bugs after this is merged.
    
    Bug: https://bugs.gentoo.org/830165
    Closes: https://bugs.gentoo.org/654118
    Closes: https://bugs.gentoo.org/695210
    Closes: https://bugs.gentoo.org/733556
    Closes: https://bugs.gentoo.org/752507
    Closes: https://bugs.gentoo.org/770439
    Closes: https://bugs.gentoo.org/778035
    Closes: https://bugs.gentoo.org/865201
    Closes: https://bugs.gentoo.org/868471
    Closes: https://bugs.gentoo.org/903754
    Closes: https://bugs.gentoo.org/915384
    Closes: https://bugs.gentoo.org/916067
    Closes: https://bugs.gentoo.org/918613
    Closes: https://bugs.gentoo.org/918989
    Closes: https://bugs.gentoo.org/920283
    Closes: https://bugs.gentoo.org/920808
    Closes: https://bugs.gentoo.org/921313
    Closes: https://bugs.gentoo.org/922333
    Closes: https://bugs.gentoo.org/922433
    Closes: https://bugs.gentoo.org/934196
    Closes: https://bugs.gentoo.org/941506
    Closes: https://bugs.gentoo.org/947253
    Signed-off-by: Ionen Wolkens <ionen@gentoo.org>

 media-video/ffmpeg/ffmpeg-9999.ebuild      | 1014 +++++++++++++---------------
 media-video/ffmpeg/files/chromium-r3.patch |   53 ++
 media-video/ffmpeg/metadata.xml            |    8 +
 3 files changed, 538 insertions(+), 537 deletions(-)