36 comments

  • qbow88311 hours ago
    Days since last ffmpeg CLI wrapper: 0<p>It&#x27;s incredible what lengths people go to to avoid memorizing basic ffmpeg usage. It&#x27;s really not that hard, and the (F.) manual explains the basic concepts fairly well.<p>Now, granted, ffmpeg&#x27;s defaults (reencoding by default and only keeping one stream of each type unless otherwise specified) aren&#x27;t great, which can create some footguns, but as long as you remember to pass `-c copy` by default you should be fine.<p>Also, <i>hiding</i> those footguns is likely to create more harm than it fixes. Case in point: &quot;ff convert video.mkv to mp4&quot; (an extremely common usecase) maps to `ffmpeg -i video.mkv -y video.mp4` here, which does a full reencode (losing quality and wasting time) for what can usually just be a simple remux.<p>Similarly, &quot;ffmpeg extract audio from video.mp4&quot; will unconditionally reencode the audio to mp3, again losing quality. The quality settings are also hardcoded and hidden from the user.<p>I can sympathize with ffmpeg syntax looking complicated at first glance, but the main reason for this is just that multimedia is <i>really complicated</i> and that some of this complexity is <i>necessary</i> in order to not make stupid mistakes that lose quality or waste CPU resources. I truly believe that these ffmpeg wrappers that try to make it seem overly simple (at least when it&#x27;s <i>this</i> simple, i.e. not even exposing quality settings or differentiating between reencoding and remuxing) are more hurtful than helpful. Not only can they give worse results, but by hiding this complexity from users they also give users the wrong ideas about how multimedia works. &quot;Abstractions&quot; like this are exactly how beliefs like &quot;resolution and quality are the same thing&quot; come to be. I believe the way to go should be <i>educating</i> users about video formats and proper ffmpeg usage (e.g. with good cheat sheets), not by hiding complexity that really should not be hidden.<p>Edit: Reading through my comment again, I have to apologize for the slightly facetious opening statement, even if I quality it later on. The fact that so many ffmpeg wrappers exists <i>is</i> saying something about its apparent difficulty, but as I argue above, a) there are reasons for this (namely, multimedia itself just being complicated), and b) I believe there are good and bad ways to &quot;fix&quot; this, with oversimplified wrappers being more on the &quot;bad&quot; side.
    • Tempest19818 hours ago
      &gt; It&#x27;s really not that hard,<p>I&#x27;ve learned not to say this. Different things are easy&#x2F;hard for each of us.<p>Reminds me of a discussion where someone argued, &quot;why don&#x27;t all the poor&#x2F;homeless people just go get good jobs?&quot;<p>Edit: I know your comment was meant to inspire&#x2F;motivate us to try harder. Maybe it&#x27;s easier than it appears.
      • there_is_try8 hours ago
        Empathy is really not that hard.
      • MattDaEskimo6 hours ago
        I would agree with this statement before LLMs. Reading manuals can take time, be messy, and are sometimes hard to understand.<p>Now, I can simply ask any LLM to write the command, and understand any following issues or questions.<p>For example, my OS records videos as WEBM. Using the default settings for transforming to MP4 usually fails from a resolution ratio issue. I would be deadlocked using this library.<p>It really isn&#x27;t that hard anymore.
      • ThrowawayTestr2 hours ago
        ChatGPT is pretty good at generative commands
    • juujian9 hours ago
      Yes, I use ffmpeg about once a year, in about 350 years I really ought to have all the syntax figure out.
    • ninalanyon8 hours ago
      &gt; It&#x27;s really not that hard,<p>if you are doing it often that&#x27;s true. But for people like me who do it once every month or two it really is hard to memorize, especially if it&#x27;s not exactly the same task.<p>What I would love would be an interactive script that asked me what I was trying to do and constructed a command line for me while explaining what it would do and the meaning of each argument. And of course it should favour commands that do not re-encode where possible.
      • navane13 minutes ago
        I also use ffmpeg once a month. My new plan: build my own scripts like the ones in op. But self built, only for that operation or three that I do.
      • crazygringo4 hours ago
        I swear I want this as a general tool for all command-line tools.<p>Start the tool, and just list all of the options in order of usage popularity to toggle on as desired, with a brief explanation, and a field to paste in arguments like filenames or values when needed. If an option is commonly used with another (or requires it), provide those hints (or automatically add the necessary values). If a value itself has structure (e.g. is itself a shell command), drill down recursively. Ensure that quotes and spaces and special characters always get escaped correctly.<p>In other words, a general-purpose command-line builder. And while we&#x27;re at it, be able to save particular &quot;templates&quot; for fast re-use, identifying which values should be editable in the future.<p>I can&#x27;t be the first person to think of this, but I&#x27;ve never come across anything like it and don&#x27;t understand why not. It doesn&#x27;t require AI or anything. Maybe it&#x27;s the difficulty involved in creating the metadata for each tool, since man pages aren&#x27;t machine-readable. But maybe that&#x27;s where AI can help -- not in the tool itself, but to create the initial database of tool options, that can then be maintained by hand?<p>(Navi [1] does the templating part, but not the &quot;interactive builder&quot; part.)<p>[1] <a href="https:&#x2F;&#x2F;github.com&#x2F;denisidoro&#x2F;navi" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;denisidoro&#x2F;navi</a>
        • darrenf3 hours ago
          I’m trying to understand the “In order of usage popularity” thing — this implies telemetry in CLIs, doesn’t it? Wouldn’t the order of options change&#x2F;fluctuate over time?<p>Or if no telemetry but based on local usage, it would promote&#x2F;reinforce the options you already can recall and do use, hiding the ones you can’t&#x2F;don’t?
          • crazygringo1 hour ago
            You could make it opt-in telemetry in the tool itself, that would probably be good enough.<p>But also, you could probably be just as accurate by asking an LLM to order the options by popularity based on their best guess based on all the tutorials they&#x27;ve trained on.<p>Or just scrape Stack Overflow for every instance of a command-line invocation for each tool and count how many times each option is used.<p>Ranking options by usage is the least complicated part of this, I think. (And it only matters for the popular options anyways -- below a certain threshold they can just be alphabetical.)
            • zahlman1 hour ago
              &gt; But also, you could probably be just as accurate by asking an LLM to order the options by popularity based on their best guess based on all the tutorials they&#x27;ve trained on.<p>&gt; Or just scrape Stack Overflow for every instance of a command-line invocation for each tool and count how many times each option is used.<p>Even trusting the developer&#x27;s intuition is better than nothing, at least if you make sure the developer is prompted to <i>think</i> about it. (For major projects, devs might also be aware that certain features are associated with a large fraction of issue reports, for example.)
          • reassess_blind2 hours ago
            Just do a best-guess list. Or do a survey. Or just scrape the most common features used across Github repos.
      • larodi7 hours ago
        Indeed why not have —tui option and some basic menu? Even a simplified scripting with reasonable API would be better.<p>I find myself bothering exactly zero times to memorise this obnoxiously long command line. Claude fills in, and I can explore features better. What’s not to like? That I’m getting dumber for not memorising pages of cli args?<p>Love the project, but as with every Swiss knife this conversation is a thing and relevant. We had similar one reg JQ syntax and I’m truly convinced JQ is wonderful and useful tool. But I’m not gonna bother learning more DSLs…
      • magicalhippo4 hours ago
        And they change quite frequently, from our POV.<p>That said, I started wrtiting scripts when I use ffmpeg some time ago. At least then I have a non-zero starting point next time.
    • mhuffman6 hours ago
      &gt;It&#x27;s really not that hard<p>It is only a couple of thousand options[0], just memorize them! It super simple, barely an inconvenience!<p>[0]<a href="https:&#x2F;&#x2F;gist.github.com&#x2F;tayvano&#x2F;6e2d456a9897f55025e25035478a3a50" rel="nofollow">https:&#x2F;&#x2F;gist.github.com&#x2F;tayvano&#x2F;6e2d456a9897f55025e25035478a...</a>
    • zahlman1 hour ago
      &gt; It&#x27;s incredible what lengths people go to to avoid memorizing basic ffmpeg usage. It&#x27;s really not that hard, and the (F.) manual explains the basic concepts fairly well.<p>I&#x27;m usually the one telling everyone else that various Python packaging ecosystem concepts (and possibly some other things) are &quot;really not that hard&quot;. Many FFMpeg command lines I&#x27;ve encountered come across to me like examples of their own, esoteric programming language.<p>&gt; Case in point: &quot;ff convert video.mkv to mp4&quot; (an extremely common usecase) maps to `ffmpeg -i video.mkv -y video.mp4` here, which does a full reencode (losing quality and wasting time) for what can usually just be a simple remux.... Similarly, &quot;ffmpeg extract audio from video.mp4&quot; will unconditionally reencode the audio to mp3, again losing quality.<p>That sounds like a bug report &#x2F; feature request rather than a problem with the approach.<p>&gt; The quality settings are also hardcoded and hidden from the user.<p>This is intended so that users don&#x27;t have to understand what quality settings are available and choose a sensible default.<p>&gt; and that some of this complexity is necessary in order to not make stupid mistakes<p>For example, the case of avoiding re-encodes to switch between container formats could be handled by just maintaining a mapping.<p>In fact, I&#x27;ve felt the lack of that mapping recently when I wanted to extract audio from some videos and apply a thumbnail to them, because different audio formats have different rules for how that works (or you might be forced to use some particular container format, and have to research which one is appropriate).
    • BeetleB7 hours ago
      &gt; It&#x27;s incredible what lengths people go to to avoid memorizing basic ffmpeg usage. It&#x27;s really not that hard<p>It&#x27;s not hard - just not a good use of our time. For 99% of HN users, ffmpeg is not a vital tool.<p>I have to use it less than twice a year. Now I just go and get an LLM to tell me the command I need.<p>And BTW, I spend a lot of time memorizing things (using spaced repetition). So I&#x27;m not averse to memorizing. ffmpeg simply doesn&#x27;t warrant a place in my head.
    • e-Minguez10 hours ago
      If you use it from time to time it would be very challenging to remember the million of different options ffmpeg has.
    • WhitneyLand9 hours ago
      “It&#x27;s really not that hard”<p>I’m going to guess your job does not involve much UX design?
      • qbow8839 hours ago
        I&#x27;m not saying it couldn&#x27;t be better (and I even gave examples), my point is that the drawbacks of such a wrapper outweigh the benefits, at least when it&#x27;s such an oversimplified one. I&#x27;ve said in other replies how I&#x27;d be very interested in e.g. an alternative libav* frontend with better defaults and more consistent argument syntax, but I don&#x27;t think that this invalidates my criticism of the linked project.
      • christstopit7 hours ago
        [dead]
    • koyote3 hours ago
      You&#x27;re getting a lot of flak due to how you started off your comment, but I mostly agree with you.<p>In my opinion there are two kinds of users: 1. Users who use FFmpeg regularly enough to know&#x2F;understand the parameters. 2. Users who only use FFmpeg once in a while to do something specific.<p>This wrapper is superfluous for users in group number 1. But group number 2 does not really get much out of it either, for the reasons you&#x27;ve mentioned.<p>As a member of group 2, I usually want to do something very specific (e.g. remove an audio track, convert only the video, remux to a different container, etc.). A simple English wrapper does not help me here because it is not powerful enough; the defaults are usually not what I want. What I need is a tool that will take a more detailed English statement of what I want to achieve and spit out the FFmpeg command with explanations for what each parameter does and how it achieves my goal. We have this today: AI; and it mostly works (once you&#x27;ve gone through several iterations of it hallucinating options that do not exist...).
      • qbow8832 hours ago
        Thank you, this explains my thoughts really well.
    • ubercow135 hours ago
      Totally disagree, I have a wrapper I wrote myself for converting things, often for sharing the odd little clip online or such. It produces a <i>complex</i> command that is not easy to just type out, that does multiple things to maximise compatibility like<p>- making sure pixel are square while resizing if the video resolution is too large<p><pre><code> (&quot;scale=w=if(gt(iw*sar\\,ih)\\,min(ceil(iw*sar&#x2F;2)*2\\,{})\\,ceil(iw*sar*min(ih\\,{})&#x2F;ih&#x2F;2)*2):h=if(gt(ih\\,iw*sar)\\,min(ceil(ih&#x2F;2)*2\\,{})\\,ceil(ih*min(iw*sar\\,{})&#x2F;iw&#x2F;sar&#x2F;2)\*2):out_range=limited,zscale,setsar=1&quot;) </code></pre> - dealing with some HDR or high gamut thing I can&#x27;t really remember that can result from screen recording on macos using some method I was using at some point<p>- setting this one tag on hevc files that macos needs for them to be recognised as hevc but isn&#x27;t set by default<p>- calculating the target bitrate if I need a specific filesize and verifying the encode actually hit that size and retrying if not (doesn&#x27;t always work first time with certain hardware encoders even if they have a target or max bitrate parameter)<p>- dealing with 2-pass encoding which is fiddly and requires two separate commands and the parameters are codec specific<p>- correctly activating hardware encoding for various codecs<p>- etc<p>And this is just for the basic task of &quot;make this into a simple mp4&quot;
      • qbow8835 hours ago
        Yes, absolutely. Multimedia is complicated.<p>But my issue with the linked tool is that it does <i>none</i> of the things you mentioned. All it does it make already very easy things even easier. Is it really that much harder to remember `ffmpeg -i inputfile outputfile.ext` than `ff convert inputfile to ext`?<p>I&#x27;ve explained this in other replies here but I am neither saying that ffmpeg wrappers are automatically bad, nor that ffmpeg cannot be complicated. I am only saying that <i>this specific tool</i> does not really help much.
    • Gud7 hours ago
      “It’s really not that hard”, well a lot of people have better things to do than remember parameters to commands we barely use.
    • Forgeties7910 hours ago
      Some people just want to use an intuitive tool with better QoL, even if it leads to compromises, to do a job swiftly without going over documentation&#x2F;learning a ton of new things. Not everything has to be an educational experience. ffmpeg exists in its original form like you prefer, but some folks want to use lossless cut. Nothing wrong with that IMO.<p>Personally I think it’s great that it’s such a universally useful tool that it has been deployed in so many different variations.
      • hnarn10 hours ago
        &gt; Some people just want to use a tool to do a job swiftly. Not everything has to be educational.<p>&gt; some folks want to use lossless cut<p>In that case I would encourage you to ruminate on what the following in the post you&#x27;re replying to means and what the implications are:<p>&gt; &quot;ff convert video.mkv to mp4&quot; (an extremely common usecase) maps to `ffmpeg -i video.mkv -y video.mp4` here, which does a full reencode (losing quality and wasting time) for what can usually just be a simple remux<p>Depending on the size of the video, the time it would take you to &quot;do the job swiftly&quot; (i.e. not caring about how the tools you are using actually work) might be more than just reading the ffmpeg manual, or at the very least searching for some command examples.
        • foodevl9 hours ago
          &gt; &gt; some folks want to use lossless cut &gt; In that case I would encourage you to ruminate on what the following in the post you&#x27;re replying to means and what the implications are:<p>You may have misunderstood the comment: &quot;lossless cut&quot; is the name of an ffmpeg GUI front end. They&#x27;re not discussing which exact command line gives lossless results.
        • wpm9 hours ago
          The thing is that when a video is being re-encoded, so long as I&#x27;m not trying to play games on my computer at the same time, I&#x27;m free to go do something else. It does not command any of my attention while its working, whereas sitting and reading the man pages commands my attention absolutely.
        • Forgeties799 hours ago
          As the other person said (and this is my mistake for not capitalizing), Lossless Cut is a popular CLI wrapper for ffmpeg with a (somewhat) intuitive interface. Someone is going to be able to pick up and use that a lot faster than they are ffmpeg. I think a lot of folks forget how daunting most people find using a terminal, yet a lot of those people still want something to do a simple lossless trim of an existing video or some other little tweak. It’s good that they have both options (and more).
          • leephillips7 hours ago
            Looks like a GUI wrapper in fact, no?
            • Forgeties796 hours ago
              Yes thank you I can get a little clumsy with my acronyms. Downside of not being a proper coder&#x2F;engineer!
              • leephillips6 hours ago
                No problem. I only asked because if there <i>were</i> a CLI version I wanted to know about it.
      • qbow88310 hours ago
        Yes, I am not opposed to ffmpeg wrappers in and of themselves. Some decent ffmpeg wrappers definitely exist. But I argue in my comment above that this <i>specific</i> tool does <i>not</i> have better QoL - again, since it reencodes unconditionally with quality settings that are usually not configurable.
        • Forgeties799 hours ago
          &gt; Days since last ffmpeg CLI wrapper: 0<p>&gt;It&#x27;s incredible what lengths people go to to avoid memorizing basic ffmpeg usage. It&#x27;s really not that hard, and the (F.) manual explains the basic concepts fairly well.<p>Not really sure how else I was supposed to interpret your comment but clarification taken.<p>&gt; But I argue in my comment above that this specific tool does not have better QoL<p>For some folks it may be better&#x2F;more intuitive. It doesn’t hurt anybody by existing.<p>We all compromise with different tools in our lives in different ways. It just reads to me like an odd axe to grind.<p>Simply put: What is so bad about the existence of this project?
          • qbow8838 hours ago
            &gt; Not really sure how else I was supposed to interpret your comment<p>Yes, that was a bit facetious of me, I apologize for that.<p>&gt; What is so bad about the existence of this project?<p>Being very blunt: The fact that it reinforces the <i>extremely common</i> misconception that a) converting between containers like mkv and mp4 will always require reencoding and that b) there is a single way to reencode a video (hence suggesting that there is no &quot;bad&quot; way to reencode a video), seeing as next to no encoding settings are exposed.
            • christstopit7 hours ago
              You are overthinking this way too much, to the point that it is sounding like you are purposefully creating out-of-context problems to justify your way too long rant.<p>As the kids these days say: just take the L, man.
            • Forgeties798 hours ago
              I get what you’re saying but at the end of the day you just need to think about how most people use a tool like this. They’re looking for a simple solution to some specific problem and then they’re likely never using it again. They don’t want to deal with a full-on NLE and iMovie or whatever they have stocked is not cutting it. It’s not worth getting bent out of shape about it ultimately. There are tons of people who use ffmpeg as intended in its original form and more or less understand everything that is going on. The reason we have so many wrappers and variations all centered around ffmpeg is because of how useful it is, so it’s clearly here to stay.<p>I personally use lossless cut more than ffmpeg in the terminal just because I don’t have to really think about it and it can do most of what I need, which is simply removing or attaching things together without re-encoding. I use it maybe once every month or two, because it’s just not something I need to use a ton, so it doesn’t make sense for me to get down and dirty with the original. Ultimately I get what I need and I’m happy!
    • msla2 hours ago
      You know, writing code that doesn&#x27;t leak memory is really not that hard.<p>There. I&#x27;ve debunked Java, Python, PHP, Perl, and Rust.<p>(Or maybe, just maybe, tools should make our lives easier.)
    • zzzeek4 hours ago
      sure here&#x27;s a command that a program I wrote to record my practicing and produce different mixes uses<p><pre><code> &#x2F;usr&#x2F;bin&#x2F;ffmpeg -i &quot;&#x2F;path&#x2F;to&#x2F;musicfile.mp3&quot; -i &quot;&#x2F;path&#x2F;to&#x2F;covertune.mp3&quot; \ -filter_complex [1:a]volume=1[track1];[0a][track1]amix=normalize=false[output] \ -map [output] -b:a 192k -metadata title=15:17:01 -metadata &quot;artist=Me, 2025&quot; \ -metadata album=2025-12-23 &quot;&#x2F;path&#x2F;to&#x2F;file.mix.mp3&quot; </code></pre> chance of my coming up with that without deep poring over docs and tons of trial and error, or using claude (which is pretty much what I do nowadays): zero
      • qbow8832 hours ago
        But the chances of you being able to achieve the same with the linked tool are also zero. That&#x27;s all I am really saying. I&#x27;m not arguing that ffmpeg can get very complex (I was talking about &quot;basic&quot; ffmpeg usage in my original comment), just that `ff convert inputfile to ext` is not really simpler than `ffmpeg -i inputfile -o outputfile.ext`, which is all that this (<i>this specific</i>) tool is really doing.
        • zzzeek32 minutes ago
          Oh, well yes the ff tool shown here is a classic 80% kind of thing for sure . Claude OTOH will get you about 98% and can explain the options to you as well
    • kristopolous10 hours ago
      so you know how to swap audio with -map without having to look it up?
      • qbow88310 hours ago
        I do, yes. Though that&#x27;s not really the point, it&#x27;d already be enough to know <i>where</i> to look it up.
        • kristopolous10 hours ago
          no the point is that there are some things I&#x27;ve done a hundred times and I never remember it because it&#x27;s designed in a wildly bad way. ffmpeg, gpg, openssl and git has those things all over the place. Is it -c:v or -v:c? I don&#x27;t know. used to be -vcodec so it&#x27;s -v:c now? no it&#x27;s -c:v I think because they swapped it?<p>There isn&#x27;t internal consistency to really hold on to ... it&#x27;s just a bunch of seemingly independent options.<p>The biggest problem is open source teams really don&#x27;t get people on board that focus on customer and product the way commercial software does. This is what we get as a result
          • qbow88310 hours ago
            &gt; Is it -c:v or -v:c?<p>Sure, I agree with all of this. Like I said above, the syntax (and, even more, the defaults) isn&#x27;t great. I&#x27;m just arguing that &quot;improving the syntax&quot; should not mean &quot;hiding complexity that should not be hidden&quot;, as the linked project does. An alternative ffmpeg frontend (i.e. a new CLI frontend using the libav* libraries like ffmpeg is, not a wrapper for the ffmpeg CLI program) with better syntax and defaults but otherwise similar capabilities would be a very interesting project.<p>(The answer to your question is that both -vcodec and -c:v are valid, but I imagine that&#x27;s not the point.)<p>&gt; The biggest problem is open source teams really don&#x27;t get people on board that focus on customer and product the way commercial software does.<p>I believe in this case it may be more of a case of backwards compatibility, with options being added incrementally over time to add what was needed at the moment. Though that&#x27;s just my guess.
            • kristopolous10 hours ago
              ffmpeg doesn&#x27;t go away. it&#x27;s still there. people can use tig and git, having something that isn&#x27;t insane can live in harmony with the other thing.
    • christstopit7 hours ago
      [dead]
    • fleroviumna9 hours ago
      [dead]
  • dllu15 hours ago
    When converting video to gif, I always use palettegen, e.g.<p><pre><code> ffmpeg -i input.mp4 -filter_complex &quot;fps=15,scale=640:-2:flags=lanczos,split[a][b];[a]palettegen=reserve_transparent=off[p];[b][p]paletteuse=dither=sierra2_4a&quot; -loop 0 output.gif </code></pre> See also: this blog post from 10 years ago [1]<p>[1] <a href="https:&#x2F;&#x2F;blog.pkh.me&#x2F;p&#x2F;21-high-quality-gif-with-ffmpeg.html" rel="nofollow">https:&#x2F;&#x2F;blog.pkh.me&#x2F;p&#x2F;21-high-quality-gif-with-ffmpeg.html</a>
    • dceddia6 hours ago
      In many cases today “gif” is a misnomer anyway and mp4 is a better choice. Not always, not everywhere supports actual video.<p>But one case I see often: If you’re making a website with an animated gif that’s actually a .gif file, try it as an mp4 - smaller, smoother, proper colors, can still autoplay fine.
    • CrossVR14 hours ago
      I&#x27;ve been thinking of integrating pngquant as an ffmpeg filter, it would make it possible to generate even better pallettes. That would get ffmpeg on par with gifski.
    • dspillett11 hours ago
      Does ffmpeg&#x27;s gif processing support palette-per-frame yet? Last time I compared them (years ago, maybe not long after that blog post), this was a key benefit of gifski allowing it to get better results for the same filesize in many cases (not all, particularly small images, as the total size of the palette information can be significant).
    • crazysim11 hours ago
      Gifski (<a href="https:&#x2F;&#x2F;gif.ski&#x2F;" rel="nofollow">https:&#x2F;&#x2F;gif.ski&#x2F;</a>) might be a good alternative to look to that&#x27;s gif-pallete aware.
    • foltik8 hours ago
      It’s a shame this isn’t the default.
    • BoingBoomTschak7 hours ago
      I use `split[s0][s1];[s0]palettegen=max_colors=64[p];[s1][p]paletteuse=dither=bayer` personally, limiting the number of colors is a great way to transparently (to a certain point, try with different values) improve compression, as is bayer (ordered) dithering which is almost mandatory to not explode output filesizes.
    • xattt13 hours ago
      Those command flags just roll off the tongue like two old friends catching up!<p>&#x2F;s
  • phil2948 hours ago
    I like it and would like to see an entire Linux OS being done in a similar manner. Or shell &#x2F; wrapper &#x2F; whatever.<p>A sane homogeneous cli for once, that treats its user as a human instead of forcing them to remember the incompatible invocation options of `tar` and `dd` for absolutely no reason.<p><pre><code> zip my-folder into my-zip.tar with compression level 9 write my-iso .&#x2F;zip.zip onto external hard drive git delete commit 1a4db4c convert .&#x2F;video.mp4 and .&#x2F;audio.mp3 into .&#x2F;out.mp4 merge .&#x2F;video.mp4 and .&#x2F;audio.mp3 to .&#x2F;out.mp4 without re-encoding </code></pre> And add amazing autocomplete, while allowing as many wordings as possible. No need for LLMs.<p>One can dream.
    • ishandotpage12 minutes ago
      I think you may enjoy [Nushell](<a href="https:&#x2F;&#x2F;www.nushell.sh" rel="nofollow">https:&#x2F;&#x2F;www.nushell.sh</a>)
    • PaulDavisThe1st4 hours ago
      &gt; write my-iso .&#x2F;zip.zip onto external hard drive<p>Dang! not <i>that one</i>, the other one!<p>&gt; zip my-folder into my-zip.tar with compression level 9<p>What do you mean, I don&#x27;t have write permissions in the current working directory? I meant for you to put the output in $HOME, i mean &#x2F;tmp, i mean &#x2F;var&#x2F;tmp, i mean on the external hard drive, no other other one.<p>&gt; git delete commit 1a4db4c<p>What did you do? I didn&#x27;t mean delete it and erase it from the reflog and run gc! I just mean &quot;delete it&quot; the way any one would ever mean that! I can never get it back now!
      • phil2941 hour ago
        Things that definitely need interactive prompts before running or fail out of ambiguity otherwise. Let&#x27;s not pretend these are impossible problems to overcome design-wise.
    • self_awareness3 hours ago
      Why not use Windows or macOS then? You don&#x27;t need to use shells there.<p>I would prefer not to change the technical aspects of Linux. I actually cherish it.
    • dheera7 hours ago
      See my more generalized CLI helper which does exactly this:<p><a href="https:&#x2F;&#x2F;github.com&#x2F;dheera&#x2F;scripts&#x2F;blob&#x2F;master&#x2F;helpme" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;dheera&#x2F;scripts&#x2F;blob&#x2F;master&#x2F;helpme</a><p>Example usage:<p><pre><code> helpme ffmpeg assemble all the .jpg files into an .mp4 timelapse video at 8fps helpme zip my-folder into my-zip.tar with compression level 9 helpme git delete commit 1a4db4c ... </code></pre> This originated from an ffmpeg wrapper I wrote but then realized it could be used for all commands:<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40410637">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40410637</a>
  • HelloUsername15 hours ago
    The one good usecase I&#x27;ve found for AI chatbots, is writing ffmpeg commands. You can just keep chatting with it until you have the command you need. Some of them I save as an executable .command, or in my .txt note.
    • corobo13 hours ago
      LLMs are an amazing advance in natural language parsing.<p>The problem is someone decided that and the contents of Wikipedia was all something needs to be intelligent haha
      • madeofpalk12 hours ago
        The confusion was thinking that language is the same thing as intelligence.
        • Kiro10 hours ago
          You and me are great examples of that. We are both extremely stupid and yet we can speak.
        • Marazan12 hours ago
          This seems like a glib one liner but I do think it is profoundly insightful as to how some people approach thinking about LLMs.<p>It is almost like there is hardwiring in our brains that makes us instinctively correlate language generation with intelligence and people cannot separate the two.<p>It would be like if for the first calculators ever produced instead of responding with 8 to the input 4 + 4 = printed out &quot;Great question! The answer to your question is 7.98&quot; and that resulted in a slew of people proclaiming the arrival of AGI (or, more seriously, the ELIZA Effect is a thing).
      • andrepd9 hours ago
        And reddit, that bastion of human achievement.
    • Terr_15 hours ago
      As pessimistic about it as I am, I do think LLMs have a place in helping people turn their text description into formal directives. (Search terms, command-line, SQL, etc.)<p>... <i>Provided that</i> the user sees what&#x27;s being made for them and can confirm it and (hopefully) <i>learn</i> the target &quot;language.&quot;<p>Tutor, not a do-for-you assistant.
      • left-struck14 hours ago
        I agree apart from the learning part. The thing is unless you have some very specific needs where you need to use ffmpeg a lot, there’s just no need to learn this stuff. If I have to touch it once a year I have much better things to spend my time learning than ffmpeg command
        • rolfus12 hours ago
          Agreed. I have a bunch of little command-line apps that I use 0.3 to 3 times a year* and I&#x27;m never going to memorize the commands or syntax for those. I&#x27;ll be happy to remember the names of these tools, so I can actually find them on my own computer.<p>* - Just a few days ago I used ImageMagick for the first time in at least three years. I downloaded it just to find that I already had it installed.
        • serial_dev13 hours ago
          There is no universe where I would like to spend brain power on learning ffmpeg commands by heart.
          • skydhash11 hours ago
            No one learns those. What people do is just learning the UX of the cli and the terminology (codec, opus, bitrate, sampling,…)
        • lukeschlather7 hours ago
          The thing about ffmpeg is there&#x27;s no substitute for learning. It&#x27;s pretty common that something simple like &quot;ff convert&quot; simply doesn&#x27;t work and you have to learn about resolution, color space, profiles, or container formats. An LLM can help but earlier this year I spent a lot of time looking at these sorts of edge cases, and I can easily make any LLM wildly hallucinate by asking questions about how to use ffmpeg to handle particular files.
      • famahar11 hours ago
        Do most devs even look at the source code for packages they install? Or the compiled machine code? I think of this as just a higher level of abstraction. Confirm it works and not worry about the details of how it works
        • d-us-vb11 hours ago
          For the kinds of things you’d need to reach for an LLM, there’s no way to trust that it actually generated what you actually asked for. You could ask it to write a bunch of tests, but you still need to read the tests.<p>It isn’t fair to say “since I don’t read the source of the libraries I install that are written by humans, I don’t need to read the output of an llm; it’s a higher level of abstraction” for two reasons:<p>1. Most Libraries worth using have already been proven by being used in actual projects. If you can see that a project has lots of bug fixes, you know it’s better than raw code. Most bugs don’t show up unless code gets put through its paces.<p>2. Actual humans have actual problems that they’re willing to solve to a high degree of fidelity. This is essentially saying that humans have both a massive context window and an even more massive ability to prioritize important things that are implicit. LLMs can’t prioritize like humans because they don’t have experiences.
        • skydhash11 hours ago
          I don’t because I trust the process to get the artifacts. Why? Because it’s easy to replicate and verify. Just like how proof works in math.<p>You can’t verify LLM’s output. And thus, any form of trust is faith, not rational logic.
          • ben_w10 hours ago
            I don&#x27;t install 3rd party dependencies if I can avoid them. Why? Because although someone could have verified them, there&#x27;s no guarantee that anybody actually did, and this difference has been exploited by attackers often enough to get its own name, a &quot;supply-chain attack&quot;.<p>With an LLM’s output, it is short enough that I can* put in the effort to make sure it&#x27;s not obliviously malicious. Then I save the output as an artefact.<p>* and I do put in this effort, unless I&#x27;m deliberately experimenting with vibe coding to see what the SOTA is.
            • skydhash9 hours ago
              &gt; Because although someone could have verified them, there&#x27;s no guarantee that anybody actually did<p>In the case of npm and the like, I don&#x27;t trust them because they are actually using insecure procedures, which is proven to be so. And the vectors of attacks are well known. But I do trust Debian and the binaries they provide as the risks are for the Debian infrastructure to be compromised, malicious code in in the original source, and cryptographic failures. All threes are possibles, but there&#x27;s more risk of bodily harm to myself that them happening.
      • xattt13 hours ago
        It you stretch it little further, those formal directives also include language and vocabulary of a particular domain (legalese, etc…).
      • eviks14 hours ago
        The &quot;provided&quot; isn&#x27;t provided, of course, especially the learning part, that&#x27;s not what you&#x27;d turn to AI for vs more reliable tutoring alternatives
    • Tempest198115 hours ago
      One that older AI struggled with was the &quot;bounce&quot; effect: play from 0:00 to 0:03, then backwards from 0:03 to 0:00, then repeat 5 times.
      • geysersam12 hours ago
        Just tried it and got this, is it correct?<p>&gt; Write an ffmpeg command that implements the &quot;bounce&quot; effect: play from 0:00 to 0:03, then backwards from 0:03 to 0:00, then repeat 5 times.<p><pre><code> ffmpeg -i input.mp4 \ -filter_complex &quot; [0:v]trim=0:3,setpts=PTS-STARTPTS[f]; [f]reverse[r]; [f][r]concat=n=2:v=1:a=0[b]; [b]loop=loop=4:size=150:start=0 &quot; \ output.mp4</code></pre>
        • Tempest19815 hours ago
          Thanks, but no luck. I tested it on a 3 second video, and got a 6 second video. I.e. it bounced 1 time, not 5 times.<p>Maybe this should be an AI reasoning test.<p>Here is what eventually worked, iirc (10 bounces):<p><pre><code> ffmpeg -i input.mkv -filter_complex &quot;split=2[fwd][rev_in]; [rev_in]reverse[rev]; [fwd][rev]concat=n=2,split=10[p1][p2][p3][p4][p5][p6][p7][p8][p9][p10]; [p1][p2][p3][p4][p5][p6][p7][p8][p9][p10]concat=n=10[outv]&quot; -map &quot;[outv]&quot; -an output.mkv</code></pre>
    • beepbooptheory14 hours ago
      But doesnt something like this interface kind of show the inefficiency of this? Like we can all agree ffmpeg is somewhat esoteric and LLMs are probably really great at it, but at the end of the day if you can get 90% of what you need with just some good porcelain, why waste the energy spinning up the GPU?
      • pixelpoet14 hours ago
        Requiring the installation of a massive kraken like node.js and npm to run a commandline executable hardly screams efficiency...
        • RadiozRadioz11 hours ago
          That&#x27;s a deficiency with this particular implementation, not an inherent disadvantage to the method
      • chpatrick13 hours ago
        Because FFmpeg is a swiss army knife with a million blades and I don&#x27;t think any easy interface is really going to do the job well. It&#x27;s a great LLM use case.
        • skydhash11 hours ago
          But you only need to find the correct tool once and mark it in some way. Aka write a wrapper script, jot down some notes. You are acting like you’re forced to use the cli each time.
          • NewsaHackO5 hours ago
            One can do that with LLM as well. Honestly, I almost always just save the command if I think I am going to use it later. Also, I can just look back at the chat history.
        • beepbooptheory9 hours ago
          I know everybody uses a subscription for these things, but doesn&#x27;t it at least <i>feel</i> expensive to use an LLM like this? Like turning on the oven to heat up a single slice of pizza.
          • lukeschlather7 hours ago
            No, LLMs are extremely useful for dealing with ffmpeg. Also I don&#x27;t think they&#x27;re sufficient, they get confused too easily and ffmpeg is extremely confusing.
          • ThrowawayTestr2 hours ago
            ChatGPTs free tier is just fine for me.
      • geysersam12 hours ago
        Because getting 90% might not be good enough, and the effort you need to expend to reach 97% costs much more than the energy the GPU uses.
      • imiric14 hours ago
        Because the porcelain is purpose built for a specific use case. If you need something outside of what its author intended, you&#x27;ll need to get your hands dirty.<p>And, realistically, compute and power is cheap for getting help with one-off CLI commands.
  • vithalreddy14 hours ago
    Can&#x27;t access the githup repo <a href="https:&#x2F;&#x2F;github.com&#x2F;josharsh&#x2F;ezff" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;josharsh&#x2F;ezff</a>
    • az09mugen12 hours ago
      Same here, I get a 404 from github. The said link is at the bottom of the submitted npmjs page.
    • ramigb12 hours ago
      yeah me too but npm has the code tab <a href="https:&#x2F;&#x2F;www.npmjs.com&#x2F;package&#x2F;ezff?activeTab=code" rel="nofollow">https:&#x2F;&#x2F;www.npmjs.com&#x2F;package&#x2F;ezff?activeTab=code</a>
  • karmakaze9 hours ago
    I would definitely use an LLM, to see what the suggested options do and tweak them.<p>Using a different package name could be helpful. I searched for ezff docs and found a completely different Python library. Also ez-ffmpeg turns up a Rust lib which looks great if calling from Rust.
  • tgsovlerkhgsel7 hours ago
    LLMs are a great interface for ffmpeg. Sometimes it takes 2-3 attempts&#x2F;fixes (&quot;The subtitles in the video your command generated are offset: i see the subtitles from the beginning of the movie but the video is cut from the middle of the movie as requested, fix the command&quot;) but it generally creates complex commands much more quickly than manual work (reading the man page, crafting the command, debugging it) would.
  • ramon15611 hours ago
    &gt; it handles 20 common patterns ... that cover 90%<p>Could you elaborate on this? I see a lot of AI-use and I&#x27;m wondering if this is claude speaking or you
  • btbuildem9 hours ago
    npm? Have we learned nothing from the weekly node&#x2F;npm security breaches? Not putting that hot mess anywhere near my dev box, thanks.
  • Workaccount29 hours ago
    The total upheaval of the current computing paradigm that AI will bring, if nothing else, is<p>&quot;Hey computer, can you convert that funny kitchen cooking scene in this movie to a .gif I can share online?&quot;<p>You&#x27;re wasting your time on a dead man walking paradigm doing anything else. &quot;Plain English&quot; <i>actually</i> means plain English now.
    • two_handfuls8 hours ago
      You&#x27;re not wrong, but also there is value in a tool that will behave the same way consistently and has been vetted. I wouldn&#x27;t be so down on this work.
      • andyfilms18 hours ago
        It is a bit of a catch-22, a plain english wrapper opens up the tool to be more widely used by novices, but also prevents those novices from actually learning the tool.
        • Gud7 hours ago
          Not really, how are they prevented from using the manual or the copious amounts of examples out there?<p>Memorising command line options beyond the absolute basics has rarely been helpful to me. And I use FreeBSD, where arcane commands are plentiful.
          • andyfilms14 hours ago
            Nothing, but after becoming reliant on an LLM they may simply become overwhelmed and give up once they outgrow it&#x27;s capabilities. I&#x27;ve seen this happen to several people I know.
      • Workaccount27 hours ago
        It&#x27;s not so much being down on the work, as it is being down on 30 years of keyboard junkies proclaiming &quot;Plain English&quot; interfaces.
    • christstopit8 hours ago
      [dead]
  • low_tech_punk4 hours ago
    No AI is appealing but there is the cliff problem. If there is one small thing the mini language can&#x27;t handle, the user would have no chance solving it themselves. They might as well start with an LLM solution first.<p>One workaround is that when there is syntax error, let user optionally switch to LLM?
  • eviks14 hours ago
    That&#x27;s the problem ideally solved by typed data, i.e., some UI where instead of trying to memorize whether it&#x27;s thumb&#x2F;s&#x2F;nails you can read the closed list of alternatives, read contextual help and pick one
    • my_brain_saying14 hours ago
      This is why we have fish tab completions. Does exactly that; list of possible commands with contextual help. Fish rules.
      • eviks14 hours ago
        Yeah, no, that&#x27;s a pale imitation that only addresses the one specific example given. But, like, how would you even know what target formats are supported? Break the flow and look it up or simply read the drop-down list? The free type-any-text interface with poor helpers is the worst in accessibility<p>Which format is the default if no argument is given?<p>Or more complicated contextual knowledge - if you cut 1sec of a video file, does fish autocomplete to tell you whether the video is reencoded or cut (otherwise) losslessly<p>Also, what does fish complete to on Windows?
        • skydhash11 hours ago
          Which flow is being broken here? Especially when the information is easily accessible with `man`.
          • eviks10 hours ago
            the flow that doesn&#x27;t require you to open a different tab or cancel a command to `man` your way through dozens of poorly searchable pages of documentation, but allows you to continue translating what you want in your mind into the interface command with delay potentially subsecond interrupts
            • skydhash10 hours ago
              Is there kind of rewards for speed running typing ffmpeg flags? Like an advent of ffmpeg?<p>I know what I want to do, I don&#x27;t know how it&#x27;s being done, but there&#x27;s a wealth of information that is very accessible. So I just read it.<p>It&#x27;s very easy to type `apropos ffmpeg`. And even if you typed `man ffmpeg`, if you go to the end, you will find related manuals name for more information. And you can always use the pager (`less` in most case) facility for quick search.<p>I believe that a lot of frustration comes from people unwilling to learn the conceptual basis of the tools they are using.
              • eviks10 hours ago
                What&#x27;s the reward for trivializing real issues and coming up with broken &quot;solutions&quot;?<p>&gt; It&#x27;s very easy to type `apropos ffmpeg`<p>No it&#x27;s not. First, that&#x27;s not a Windows command, so right off the bat you&#x27;ve cut off the largest OS. Second, your command is naively empty and it&#x27;s telling that you&#x27;ve given it instead of an actual search query because you wouldn&#x27;t be able to come up with a great one right away that would result in the correct result at the top - while the correct resuls is &quot;hardcoded&quot; in the field type in the UI. So yeah, go on, find that perfect query and then explain why you think every single user should be able to do the same quickly. Then you can think about how justified your other beliefs are about basic workflow issues you don&#x27;t understand
                • skydhash9 hours ago
                  &gt; What&#x27;s the reward for trivializing real issues and coming up with broken &quot;solutions&quot;<p>Then any solutions is broken in this way. Even my bluetooth speaker comes with a manual. Not reading it and saying the speaker is broken, because you can&#x27;t figure how to connect is pure delusion. Same as not reading ffmpeg manual and expecting to know how to use it.<p>&gt; First, that&#x27;s not a Windows command, so right off the bat you&#x27;ve cut off the largest OS.<p>ffmpeg on Window is so far the beaten path that it may as well be in Mordor. I would gladly bet that someone that knows how to run ffmpeg on windows also knows how to find the documentation for it.<p>&gt; So yeah, go on, find that perfect query<p>Why would I find the perfect query? Do you go in the library and then find the correct line of the correct book in one go? Or do you consult the list of books of books for a theme, select a few candidates, consult their index, and then read the pages?<p>Then all of that is left to do is to note down the reference if you need to consult the book again (no need to remember everything).
                  • eviks8 hours ago
                    &gt; Then any solutions is broken in this way.<p>Nope, you&#x27;re just doing the same thing - purposefully ignoring the issue to make your non-solution comparable...<p>&gt; Even my bluetooth speaker comes with a manual.<p>... in this case - the length and scope of the manual. First, you can operate the speaker without the manual or with just a single read of the manual- so spend a few seconds to learn how to pair (but you might not even need that as &quot;hold to pair&quot; might be something you remember from other devices), then the power&#x2F;volume buttons require no manual because you&#x27;ve operated such buttons your whole life.<p>&gt; Same as not reading ffmpeg manual<p>Of course it&#x27;s not the same, the ffmpeg manual isn&#x27;t a tiny page of 5 items, and no other apps will help you learn the peculiarities of ffmpeg. Also, the whole point of intuitive UI with &quot;typed info&quot; is that you don&#x27;t need to read that huge manual to do the basics as you can simply follow the structure laid out by someone more knowledgeable<p>&gt; ffmpeg on Window is so far the beaten path that it may as well be in Mordor. I would gladly bet that someone that knows how to run ffmpeg on windows also knows how to find the documentation for it.<p>Who would take that irrelevant bet? The issue isn&#x27;t in finding! the manual!<p>&gt; Why would I find the perfect query?<p>To prove that your solution works. I know it doesn&#x27;t and challenge you to prove otherwise. Your suggestion is worse than asking users to Google, because at least there users will likely get the correct top result in a few tries for common needs<p>&gt; Do you go in the library and then find the correct line of the correct book in one go?<p>No, I open an app and pick the correct format from the drop-down menu correctly in one go<p>&gt; Or do you consult the list of books of books for a theme, select a few candidates, consult their index, and then read the pages?<p>Oh man, even in your fantasies you can&#x27;t come up with a good workflow! No wonder you&#x27;re fine suggesting everyone wastes a lot of time aproposing empty queries
                    • skydhash7 hours ago
                      If you take the set of possible ffmpeg invocations, it&#x27;s very huge. Yes, it&#x27;s possible to create some kind of wrapper that serve some common cases. And there are many of such wrappers or alternative tools like Xld (macOS) or Handbrake. But when you do need to use ffmpeg, that means that such wrapper is unfit for some reason or another. And in that case, it&#x27;s not that much of an effort to read the manual which is very comprehensive.<p>It&#x27;s the same with video viewers or music players. Often the default app of the OS is enough and they are very intuitive. But sometimes you need a bit more control and that&#x27;s when using something like vlc or mpv which their extensive filter capabilities (which requires to have the doc at hand) is mandatory.<p>ffmpeg interface is ok for what it does. Any of your suggestion would be complex to implement if it aims to support the whole feature set of ffmpeg.
          • NooneAtAll39 hours ago
            &quot;why would one solve the problem with one drop-down menu if you can solve it with 20 minutes of browsing walls of text&quot;<p>lol
  • petterroea14 hours ago
    Somehow it seems ffmpeg has become the &quot;Can it run crysis&quot; of UX design
  • alexellisuk11 hours ago
    This looks handy.. along with the odd gist of &quot;convert mkv to mp4&quot; that I have to use every other week.<p>Quite telling that these tools need to exist to make ffmpeg actually usable by humans (including very experienced developers).
    • teitoklien11 hours ago
      i figure out the niche ffmpeg commands various chain filters, etc then expose them from my python cli tool with words similar to what this gentleman above has done.<p>If one has fewer such commands its as simple as just bash aliases and just adding it to ~&#x2F;.bashrc<p>alias convertmkvtomp4=&#x27;ffmpeg command&#x27;<p>then just run it anytime with just that alias phrase i use ffmpeg a lot so i have my own dedicated cli snippet tool for me, to quickly build out complex pipeline in easier language<p>the best part is i have --dry-run then exposes the flow + explicit commands being used at each step, if i need details on whats happening and verbose output at each step
    • sallveburrpi11 hours ago
      I have a text file with some common commands, so no tools needed.<p>But yea ffmpeg is awesome software, one of the great oss projects imo. working with video is hellish and it makes it possible.
  • gcanyon10 hours ago
    I can only speak to my experience, but I spent a long time being puzzled by video editor user interfaces, until I ran into ScreenFlow about ten years ago. For whatever reason, the UI clicked, and I&#x27;ve used it ever since. It&#x27;s a single purchase, not monthly, and relatively affordable. <a href="https:&#x2F;&#x2F;www.telestream.net&#x2F;screenflow&#x2F;overview.htm" rel="nofollow">https:&#x2F;&#x2F;www.telestream.net&#x2F;screenflow&#x2F;overview.htm</a>
  • broken-kebab13 hours ago
    I like the idea, but a CLI utility dependent on Node.js is not a good thing frankly.
    • AnonC10 hours ago
      I agree. Apart from having to use npm (and its package repository being susceptible to security issues), I’d prefer something a lot simpler. Could’ve been a Rust program or a Go program (a single executable) that could be built locally or installed (using several different methods and offering a choice).
    • tclancy13 hours ago
      That ship sailed some time ago.
  • arjie8 hours ago
    I actually just use Claude code. “Stabilize the video x.mp4 and keep my daughter Astra as the subject. Convert it to a GIF that is under a megabyte”. It does a great job.<p>It will sample images from the video then go crop the video to that, stabilize if required, and then make me an optimized GIF that I can put in my weekly journal.
  • mmahemoff15 hours ago
    Very cool idea since ffmpeg is one of those tools that has a few common tasks but most users would need to look up the syntax every time to implement them (or make an alias). In line with the ease of use motivation, you might consider supporting tab completion.
  • sirodoht7 hours ago
    Inspiring! I just asked Cursor to make llmwrap inspired by this, it&#x27;s like rlwrap (readline wrap) but with LLMs!<p><a href="https:&#x2F;&#x2F;github.com&#x2F;sirodoht&#x2F;llmwrap" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;sirodoht&#x2F;llmwrap</a>
  • Kwpolska15 hours ago
    GitHub repo link returns 404.
  • spullara14 hours ago
    I have a little script that I use on the CLI to do this kind of stuff (calls an LLM to figure out how to do CLI stuff) but you can just as easily now use any of the coding agents.
  • ramigb12 hours ago
    That&#x27;s beautiful! I see a .claude folder in your code, I am curious if you&#x27;ve &quot;vibecoded&quot; the whole project or just had claude there for some tasks! not that it matters or takes away from your work but just pure curiosity as someone who enjoys betting on the LLM output XD
  • nextstep1 hour ago
    This is very nice. When I use ffmpeg recently I usually ask an LLM first but it often takes a few tries to get the exact incantation right.<p>On a side note (I’m not a web developer), why would a command line tool like this be written and distributed using node.js? Seems like an unnecessary risk to use JavaScript for a basic (local) command line tool. Couldn’t this be written more simply in like Rust or something?
  • naikrovek2 hours ago
    Small English nitpick:<p>&gt; ff slow down video.mp4 by 2x<p>How do you slow something down by 2x? x is a multiplier. 2 is a number greater than 1. Multiplying by a number greater than 1 makes the result LARGER.<p>If you’re talking about “stretch movie duration to 2x”, <i>say that instead</i>.<p>Saying something is 2x smaller or 2x shorter or 2x cheaper doesn’t make sense. 2x what? What is the unit of “cheap” or “short” or “small”?<p>How much is “1 slow down”? How many “slow down” are in the movie where you want twice as many of them? Doesn’t make sense does it? So how can something be slowed by 2x? That also doesn’t make sense.<p>I know what is trying to be said. I know what is meant. Please just say it right. Things like throw us autistic people for a freaking loop, man. This really rustles our jimmies.<p>Language is for communicating. If we aren’t all on the same page about how to say stuff, you spend time typing and talking and writing and reading and your message doesn’t make it across the interpersonal language barrier.<p>I don’t want to see people wasting their time trying to communicate good ideas with bad phrasing. I want people to be able to say what they mean and move on.<p>I also don’t want to nitpick things like this, but I don’t want phrases like “slow down by 2x” to be considered normal English, either, because they aren’t.
    • nulltype27 minutes ago
      Isn’t it somewhat common to say something like “slow this down by a factor of 2”?
    • jimbobthrowawy2 hours ago
      Reminds me of a thing Steve Mould mentioned in a video about a claim in a book &quot;The temperature outside an aeroplane is six times colder than the temperature inside a freezer.&quot;<p><a href="https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=C91gKuxutTU" rel="nofollow">https:&#x2F;&#x2F;www.youtube.com&#x2F;watch?v=C91gKuxutTU</a> - Stand-up comedy routine about bad science
  • Tempest198115 hours ago
    I was surprised that macOS (QuickTime&#x2F;Preview, iMovie) can&#x27;t read .mp4 files. Not sure if it was due to H.265 or the audio codec. I tried using ffmpeg to convert to .mov but that also failed to open, since I guess MOV is just another container format.<p>Is there an easier way?
    • kiicia14 hours ago
      MP4 is container, not format, so if you have unsupported format packed into MP4 container it won’t be played. Example is trying to play AV1 video codec on devices with M2 chip or older. It won’t play. But it will play on devices with M3 chip and newer. Easiest solution is to use other player so that you can watch any MP4 file but with software decoding where hardware decoding is not available. Examples of such players are MPV or VLC.
      • Tempest19819 hours ago
        Yes, VLC works fine for playing. The user wanted to edit some mp4 videos with iMovie (vs ffmpeg).<p>I think it was an M4 Mac. Does iMovie need a codec pack? I know some PC OEMs don&#x27;t ship an h.265 codec, pointing users to a $0.99 download. Thought Mac would include it, being aimed at content creators. Hoping for a cheaper solution than Adobe Premiere.
    • felixfoertsch15 hours ago
      IMHO the de-facto video player for macOS is [IINA](<a href="https:&#x2F;&#x2F;iina.io&#x2F;" rel="nofollow">https:&#x2F;&#x2F;iina.io&#x2F;</a>).
      • trvz14 hours ago
        That exists, but it’s still VLC.
        • wging14 hours ago
          It&#x27;s based on mpv, not vlc.
    • andrewf11 hours ago
      Try something like: ffmpeg -i in.mp4 -c:v h264 -c:a aac out.mp4<p>To re-encode the content into H.264+AAC, rather than simply &quot;muxing&quot; the encoded bitstreams from the MP4 container into a new MOV container.
      • Tempest19818 hours ago
        Thanks, I can even somewhat remember that. AI gave me args like<p><pre><code> -c:v libx264 -pix_fmt yuv420p -preset medium -crf 18 \ -c:a aac -b:a 192k \</code></pre>
        • stackedinserter8 hours ago
          &quot;-c:v h264_videotoolbox -b:v 5000k&quot; on macos, it will use hardware encoder.
    • codegladiator15 hours ago
      vlc
  • vivzkestrel11 hours ago
    I would love to see something like this for OpenSSL
  • pdyc14 hours ago
    interesting approach, i solved similar problem by creating visual tool to generate ffmpeg commands but its not the same(it cant do conversion etc.)<p>I like that you took no AI approach, i am looking for something like this i.e. understanding intent and generating command without using AI but so far regex based approaches have proved to be inadequate. I also tried indexing keywords and creating index of keywords with similar meaning that improved the situation a bit but without something heavy like bert its always a subpar experience.
  • gamer19112 hours ago
    Thanks, will definitely check this out<p>Has anyone else been avoiding typing FFmpeg commands by using file:&#x2F;&#x2F; URLs with yt-dlp
  • bdbdbdb14 hours ago
    Sometimes an idea comes along thats so obvious it makes me angry. I have been struggling with ffmpeg commands for over well a decade. All the time I wasted googling and creating scripts so I wouldn&#x27;t have to regoogle and this could have existed literally from day one
  • justinhj6 hours ago
    There is no need for a wrapper or memorizing syntax in our new llm world.
  • lerp-io1 hour ago
    claude cli for ffmpeg is op lol
  • Joyfield12 hours ago
    Uhm... Millibit, Millibyte, Megabit, Megabyte?
    • two_handfuls7 hours ago
      Good point, &quot;mb&quot; as used in the linked example would mean &quot;millibit&quot;, which is almost certainly not what they meant.
  • dheera7 hours ago
    See also:<p><a href="https:&#x2F;&#x2F;github.com&#x2F;dheera&#x2F;scripts&#x2F;blob&#x2F;master&#x2F;helpme" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;dheera&#x2F;scripts&#x2F;blob&#x2F;master&#x2F;helpme</a><p><pre><code> helpme ffmpeg assemble all the .jpg files into an .mp4 timelapse video at 8fps </code></pre> This evolved from an ffmpeg wrapper I wrote before:<p><a href="https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40410637">https:&#x2F;&#x2F;news.ycombinator.com&#x2F;item?id=40410637</a>
  • maximgeorge13 hours ago
    [dead]
  • Kcnfjhggjbh10 hours ago
    [flagged]