1/14/2025 at 9:37:28 PM
I've enjoyed using ffmpeg 1000% more since I was able to stop doing manually the tedious task of Googling for Stack Overflow answers and cobbling them into a command and got Chat GPT to write me commands instead.by hbn
1/14/2025 at 11:53:48 PM
I use ffmpeg multiple times a week thanks to LLMs. It's my top use-case for my "llm cmd" tool: uv tool install llm
llm install llm-cmd
llm cmd use ffmpeg to extract audio from myfile.mov and save that as mp3
https://github.com/simonw/llm-cmd
by simonw
1/15/2025 at 12:31:49 AM
I tried this (though with a different tool called aichat) for extremely simple stuff like just "convert this mov to mp4" and it generated overly complex commands that failed due to missing libraries. When I removed the "crap" from the commands, they worked.So much like code assistance, they still need a fair amount of baby sitting. A good boost for experienced operators but might suck for beginners.
by resonious
1/15/2025 at 7:57:23 AM
Plus you need to know the format of your source file to design the command correctly. How many audio tracks, is the first video track a thumbnail or the video, are the subtitles tracks forced, etc.And in some situations ffmpeg has some warts you have to go around. Like they introduced recently a moronic change of behaviour where the first sub tracks becomes forced/default irrespective of the original forced/default flag of the source. You need to add "-default_mode infer_no_subs" to counter that.
by cm2187
1/15/2025 at 3:19:39 PM
I usually just paste the output of `ffprobe` into Claude when it's ambiguous. Works a treat.by Philpax
1/15/2025 at 3:31:37 AM
My feelings exactly, but I think that's OK!It's another tool and one that might actually improve with time. I don't see GNU's man pages getting any better spontaneously.
Whoa, what if they started to use AI to auto-generate man pages...
by Over2Chars
1/15/2025 at 9:34:08 AM
> Whoa, what if they started to use AI to auto-generate man pages...That’s the time to start my career in woodworking.
by BlaDeKke
1/15/2025 at 2:03:36 PM
I already generate man pages (and POD) with Claude for my new projects. :DIt works really well.
by johnisgood
1/15/2025 at 11:47:14 PM
Any links you wanna share? I've never seen an AI generated man page.by Over2Chars
1/16/2025 at 5:47:11 PM
They are not public projects, but man pages and their README.md is generated albeit refined through prompts.It is not simply a "generate man page" without context.
by johnisgood
1/16/2025 at 1:01:18 PM
> what if they started to use AI to auto-generate man pages...Then they'd be wrong about 20% of the time, and still no one would read them. ;-)
(NB: I'm of the age that I do read them, but I think I'm in the minority.)
by michaelcampbell
1/15/2025 at 7:49:14 AM
Reading this feels like seing a guy getting his first car in 1920 and complaining he still has to drive it himself.by BiteCode_dev
1/15/2025 at 2:15:59 PM
To me it's more like a guy getting his first car and complaining that the car is driving him in a direction that may or may not be correct, despite his best efforts to steer it where he wants to go. And the only way to know whether he ends up in the right place is to get out of the car, look around, and maybe ask more experienced drivers. Failing that, his only option is to get back in and hope to be luckier in the next trip.Or he can just ditch the car and walk. Sure, it's slower and requires more effort, but he knows exactly how to do that and where it will take him.
by imiric
1/15/2025 at 9:57:24 AM
The beer brewers in my home town used to have a self-driving horse and cart which knew the daily delivery route going by all pubs and didn't really need a human to steer it or indeed be conscious during the trip. Expectedly, the delivery guy would get drunk first thing in the morning and just get carted about collecting the money.by LeoWattenberg
1/15/2025 at 8:07:00 AM
Pony & trap could be largely self-driving, after an initial training period. That would have been a distinct negative to "upgrading" for some, I'd imagine.by pbhjpbhj
1/15/2025 at 2:19:56 PM
It's speed and load capacity vs self-driving.If we could imagine wiring a pony to control a car, its brain, while good at navigation, would likely be inadequate at the speed that a car attains.
by nine_k
1/15/2025 at 7:53:02 AM
Sell that guy probably got carried home by his horse after drinking half a bottle of whiskey, so maybe he had a point.by atoav
1/15/2025 at 3:28:15 PM
Or maybe calling a cab and telling the cab driver each direction to get to the destination instead of the cab driver just taking you there.by shriek
1/15/2025 at 11:40:49 AM
My experience exactly.I no longer check with these AI tools after a number of attempts. Unrelated, a friend thought there was a NFL football game last Saturday at noon. Checking with Google's Gemini, it said "no", but there was one between two teams whose season had ended two weeks before at 1:00 Eastern Time and 2:00 Central. (The times are backwards.)
by assimpleaspossi
1/15/2025 at 1:51:09 PM
Do LLMs have knowledge of current events?by bityard
1/16/2025 at 1:24:22 AM
Meta.ai got it right. The free chatGPT only has data up till 2021 or something like that.by assimpleaspossi
1/17/2025 at 9:49:15 AM
> Do LLMs have knowledge of current events?I don't think the notion of "current" has been explained to them. Thay just define it out of context.
by hulitu
1/15/2025 at 2:02:58 PM
I mean, some are capable of searching the web.Ask them about the fire in LA in 2025 January.
by johnisgood
1/15/2025 at 2:29:47 AM
> "convert this mov to mp4"Did any of the commands look like the ones in the left window:
https://beta.gitsense.com/?chats=12850fe4-ffb1-4618-9215-c13...
The left window contains a summary of all the LLMs asked, including all commands. The right window contains the individual LLM responses.
I asked about gotchas with missing libraries as well, and Sonnet 3.5 said there were. Were these the same libraries that were missing for you?
by sdesol
1/15/2025 at 2:40:53 AM
Looking at this, I am pretty sure I also received a "libx264" clause. Removing it made the command work for me.by resonious
1/15/2025 at 3:23:07 AM
I don't disagree that we need to be cautious with LLMs, but I've personally stopped asking GPT-4/GPT-4 mini for technical answers. Sonnet 3.5 and DeepSeek V3 (which is much cheaper but still not as good as Sonnet) are your best bet for technical questions.Where I find GPT to perform better than Sonnet is with text processing. GPT seems to better understand what I want when it comes to processing documents.
I'm convinced that no LLM provider has created or will create a moat, and that we will always need to shop around for an answer.
by sdesol
1/15/2025 at 5:10:29 AM
everyone stopped using 4/4mini because theyre old.4o replaced 4 back in April 2024. 01/01mini replaced 4o in Fall 2024.
stop using 4. use 01mini always. its cheaper, faster, and better.
o1/o1mini will be replaced by o3/o3mini in a couple months.
by wegfawefgawefg
1/15/2025 at 5:43:42 AM
Unfortunately you need to be tier 2 to use o1-mini. The only time I really use GPT is to summarize documents and for that, GPT-4o mini works well enough and it is significantly cheaper than other high quality models, so I never really rack up an OpenAI bill.by sdesol
1/15/2025 at 12:26:34 PM
o1 is such a joke, worse than 4o in some ways like multiturn,The months old sonnet feels a generation ahead of any OAI product I've used, I'll believe the hype on o3 when I see it, remember the sora and voice roll out?
by mistercheph
1/15/2025 at 4:47:31 PM
You may want to reconsider this position.I had this bizarre bug in rust networking code where packets were getting dropped.
i dumped all 20k lines into o1pro. it thought for about ten minutes and came back telling me that my packets had a chance of being merged if set in quick succession and i needed to send the length before each message and scan packets in a loop for subdivisions on the client. this bug hadnt happened before, only when running locally on a newer faster machine, and was frequent but hard to replicate.
it was correct, and provided detailed pseudo code to solve it.
the second case involved some front end code where during an auth flow ios would force refresh on returning to the browser causing authentication state to be lost. o1pro thought for about 5 minutes before telling me ios has a heuristic with which it decides to close an app on context switch based on available ram, etc, and that i needed to conditionally check for ios and store partial state in local store on leave assuming the app could be deloaded without my control.
it was correct. with some more back and forth we fixed the bug.
these are not the kinds of problems that claude and gpt<4 have been able to help with at all.
I also used voice, and video voice extensively for translation tasks in korea, japan, and taiwan, and for controlling japanese interfaces and forms for tax documents and software.
These are very good tools.
by wegfawefgawefg
1/15/2025 at 3:22:27 PM
o1 is not a general-purpose model, and it's not very good at multi-turn; it should instead be given all the context upfront: https://www.latent.space/p/o1-skill-issueby Philpax
1/15/2025 at 4:26:02 AM
libx264 is the best encoder for h264 ffmpeg has to offer so it's pretty important you bundle it in your ffmpeg install. Those commands are perfectly standard, I've been using something like that for 10+ yearsby jack_pp
1/15/2025 at 8:28:59 PM
what exactly do you want the llm to do here? if the ask was so unambiguous and simple that it could be reliably generated, then the interface wouldn't be so complicated to use in the first place! LLMs are not in any way best suited for one-shot prompt => perfect output, and expectations to that effect are extremely unreasonable. the reason why LLMs are still hard for beginners to use is because the software is hard to use correctly. as with LLM output goes life itself: the results you get from using a tool can only ever be as good as the (mental) model used to choose that tool & the inputs to begin with. if all the information required to generate the output were contained by the initial prompt, then there would be absolutely no need to use the LLM at all in the first place.by keeganpoppen
1/15/2025 at 1:11:19 AM
Hate to be that guy, but which LLM was doing the generation? GPT-4 Turbo / Claude 3.x have not really let me down in generating ffmpeg commands - especially for basic requests - with most of their failures resulting from domain-specific vagaries that an expert would need to weigh in on mby Philpax
1/15/2025 at 2:39:17 AM
GPT-4by resonious
1/15/2025 at 3:23:32 AM
Fair enough. If you remember what you were testing with, I'd love to try it again to see if things are better now.by Philpax
1/15/2025 at 3:34:27 AM
You have a fair point. Some LLMs are better at some tasks, and prompts can make a difference no doubt.Perhaps at some point there will be a triage LLM to slurp up the problem and then decide which secondary LLM is most optimal for that query, and some tertiary LLMs that execute and evaluate it in a virtual machine, etc.
Maybe someday
by Over2Chars
1/15/2025 at 4:02:35 AM
Oh I talked to some guys who started a company that does that. This was at an AI meetup in SF last year. They were mainly focused on making $/token cheaper by directing easy/dumb queries to smaller dumber models, but it also increases output quality because some models are just better at certain things. I'm sure all the big companies already have implementations of this by now even if they don't use it everywhereby NavinF
1/17/2025 at 7:24:46 PM
Yes they are called routers. One is https://withmartian.com/by jazzyjackson
1/15/2025 at 4:47:01 AM
I was suggesting optimizing for answer quality, but optimizing for cost might be useful too I suppose for "business innovation" purposes.by Over2Chars
1/15/2025 at 1:52:02 AM
Hate to be that guy, but which model works without fail for any task that ffmpeg can do?by th0ma5
1/15/2025 at 2:04:11 AM
"Writing working commands first try for every single ffmpeg feature that exists" is the highest bar I've ever heard of, I love it. I'm gonna start listing it as a requirement on job postings. Like an ffmpeg speedrun.by iameli
1/15/2025 at 6:59:25 PM
Yes and every failure of a product turns into a support ticket.by th0ma5
1/15/2025 at 8:48:00 AM
Obligatory xkcd: https://xkcd.com/1168/.by MzxgckZtNqX5i
1/15/2025 at 9:11:20 AM
To be fair `tar` is quite easy to use once you understand the grammar of the options.by hk__2
1/15/2025 at 2:09:27 AM
I don't think there's a single human on or outside of this planet that can meet that requirement, but Claude has been pretty good to me. It's certainly a much better starting point than pouring over docs and SO posts.by Philpax
1/15/2025 at 7:00:00 PM
In my experience you still get a lot of stuff that used to work or stuff that it just makes up.by th0ma5
1/15/2025 at 8:02:50 AM
I know I struggled on getting a good command to “simply” make the videos from my Z8 smaller (in file size).Usually the color was wrong and I don’t care enough to learn about colorspaces to figure out how to fix it and it’s utterly insane how difficult it is even with LLMs.
Just reencode it as is but a little more lossy. Is that so hard?
by AuryGlenz
1/15/2025 at 10:44:22 AM
Handbrake may be a better option for you. I find that for some tasks it’s not only simpler but straight up works better than FFmpeg.by latexr
1/15/2025 at 2:17:43 AM
This doesnt exist in reality so in one sense, you could challenge the relevanceby bloqs
1/15/2025 at 7:01:15 PM
I think in the non LLM world though you at least have the trail of documentation you can unwind once you're in a bind. I don't care for prompt-a-mole fighting.by th0ma5
1/15/2025 at 10:34:22 AM
[dead]by Melomomololo
1/15/2025 at 6:21:24 PM
A while back I simply wrote my own bash function for this called `please`as in
bash> please "use ffmpeg to extract audio from myfile.mov and save it as mp3"
It will then courteously show you the command it wants to run before you agree to do it.Here is the whole thing, with its two dependent functions, so that people stop writing their own versions of this lol. All it needs is an OPENAI_API_KEY, feel free to modify for other LLMs
EDIT: Moved to a gist: https://gist.github.com/pmarreck/9ce17f7996347dd532f3e20a2a3...
Suggestions welcome- for example I want to add a feature that either just copies it (for further modification) or prepopulates the command line with it somehow (possibly for further modification, or even for skipping the approval step)
by pmarreck
1/15/2025 at 9:56:37 PM
please is such an appropriate name. Will rename my ChatGPT alias to please.by smusamashah
1/15/2025 at 7:51:33 AM
Did you just invent the LLM-equivalent of curl-piping unread shell scripts into sh?I am sure that will never cause any problems.
by atoav
1/15/2025 at 11:37:34 AM
It displays the generated command to you, there's an additional step to confirm.by bspammer
1/15/2025 at 1:58:24 PM
Ah good to clarify, thanksby atoav
1/15/2025 at 8:56:09 AM
> Did you just invent the LLM-equivalent of curl-piping unread shell scripts into sh?Many such cases.
by ykonstant
1/15/2025 at 12:01:18 AM
"The future is already here. It's just not very well distributed"(honestly, the work you share is very inspiring)
by dekhn
1/15/2025 at 12:46:03 AM
>This will then be displayed in your terminal ready for you to edit it, or hit <enter> to execute the prompt. If the command doesnt't look right, hit Ctrl+C to cancel.I appreciate the UI choice here. I have yet to do anything with AI (consciously and deliberately, anyway) but this sort of thing is exactly what I imagine as a proper use case.
by zahlman
1/15/2025 at 1:12:14 AM
Just like all other code. There will be user-respecting open source code and tools, and there's user-disrespecting profitable closed code that makes too many decisions for you.by hnuser123456
1/15/2025 at 8:16:21 AM
Hypertalk <https://en.wikipedia.org/wiki/HyperTalk> lives.by mvonballmo
1/15/2025 at 12:00:37 PM
uv?by Beijinger
1/15/2025 at 12:23:55 PM
like pip but written in rustby phrotoma
1/15/2025 at 1:51:18 AM
You should figure out what went wrong for the other commenter and fix your tool.by th0ma5
1/15/2025 at 9:35:23 AM
While I love that that works, I still feel like just maybe ffmpeg needs a better interface. Not necessarily a GUI, just a better designed command line.by mrweasel
1/15/2025 at 1:59:19 AM
I think I’m finally sold on actually attempting to add some LLM to my toolbelt.As a helper and not a replacement, this sounds grand. Like the most epic autocomplete. Because I hate how much time I waste trying to figure out the command line incantation when I already know precisely what I want to do. It’s the weakest part of the command line experience.
by Waterluvian
1/15/2025 at 3:41:59 AM
But possibly the most rewarding. The struggle is its own reward that pays off later many times over.by Over2Chars
1/15/2025 at 1:23:56 PM
There are times I feel minor guilt for using an LLM to relieve brainwork, like figuring out an algorithm. That's probably a skill I should continue practicing for my own sake.ffmpeg commands though? It's really not a practical skill outside of using ffmpeg. There's nothing really rewarding to me about memorizing awkwardly designed CLI incantations. It's all arbitrary.
by hbn
1/16/2025 at 12:28:44 AM
memorization is not what I'm talking about.I'm talking about
1. you have a problem, you try something and it doesn't work. 2. you find an LLM and it "gives" you the answer with one or two tries 3. problem solved! what have you learned? How to have answers given to you when you ask.
or 2. you look for an answer in a dizzying haze of man pages, quacks, and website Q&As. 3. you try and try again and eventually problem solved. you have learned not only how to solve a particular problem but your overall ability to solve similar problems has done up. You've learned how to fish, not just ask an LLM for a fish.
by Over2Chars
1/15/2025 at 5:05:00 AM
Not for me. It’s a tool I don’t care to use any more than I have to. I’m much more interested in what I’m using the tool to accomplish.by Waterluvian
1/15/2025 at 6:20:21 AM
I am not talking about the tool per se. I am talking about the skill of persistence and creativity in the face of a problem.Learning a tool is useful, even invaluable, but if you don't have the persistence to use it, it's useless.
And many tools are just partially useful under some conditions. So creativity in using them is also useful.
So it's not about the tools, its about not giving up and trying different things, which makes all tools more effective, and problem-solving more likely.
by Over2Chars
1/15/2025 at 3:23:27 PM
Yeah, and I'll do that with all of the tools and problems I actually care about :pby Philpax
1/15/2025 at 11:46:35 PM
There's something to be said for that approach.It's not the only approach, and definitely as someone once said, premature optimization is the root of all evil, and maybe you are optimizing at exactly the right time.
Who can tell?
by Over2Chars
1/14/2025 at 9:58:55 PM
For the longest time I had ffmpeg in the same bucket as regex: "God I really need to learn this but I'm going to hate it so much." Then ChatGPT came along and solved both problems!by levocardia
1/15/2025 at 1:24:19 AM
Interesting. Being able to use regexps for text processing through my career has probably saved me a few thousand hours of programming one-off solutions so far. It is one of those skills that really pays off to learn proper.And speaking of ffmpeg, or tooling in general, I tend to make notes. After a while you end up with a pretty decent curated reference.
by zxvkhkxvdvbdxz
1/15/2025 at 2:32:45 AM
I use regexes a lot. The main thing that always trips me up is dealing with escaping, because different tools I use – vim, sed, rg, and so on – sometimes have different meanings for when to escape or not.In one tool you’ll use + to match one or more times, and \+ to mean literal plus sign.
In another tool you’ll use \+ to match one or more time, and + to mean literal plus sign.
In one tool you’ll use ( and ) to create a match group, and \( and \) to mean literal open and close parentheses.
In another tool you’ll use \( and \) to create a match group, and ( and ) to mean literal open and close parentheses.
This is basically the only problem I have when writing regexes, for the kinds of regexes I write.
Also, one thing that’s not a problem per se but something that leads me to write my regexes with more characters than strictly necessary is that I rarely use shorthand for groups of characters. For example the tool might have a shorthand for digit but I always write [0-9] when I need to match a digit. Also probably because the shorthand might or might not be different for different tools.
Regexes are also known to be “write once read never”, in that writing a regex is relatively easy, but revisiting a semi-complicated regex you or someone else wrote in the past takes a little bit of extra effort to figure out what it’s matching and what edits one should make to it. In this case, tools like https://regex101.com/ or https://www.debuggex.com/ help a lot.
by codetrotter
1/15/2025 at 9:02:57 AM
The problem with escaping (like with using quotes) is often that you need to know through how many parsers the string goes. The shell or editor, the language you are programming in and the regexp engine each time can strip off an escape character or a set of outer quotes. That and of course different dialects of regexp makes things complicated.by nuancebydefault
1/16/2025 at 8:58:44 AM
This is like saying that words have different meaning if you talk to someone in english or french. Most tools have a switch to inform them that you're going to talk in english (perl-compatible regular expressions).by account42
1/15/2025 at 3:15:59 AM
No one doubts the power or utility of regexes or ffmpeg, but they are both complicated beasts that really take a lot of skill.They're both tools where if they're part of your daily workflow you'll get immense value out of learning them thoroughly. If instead you need a regex once or twice a week, the benefit is not greater than the cost of learning to do it myself. I have a hundred other equally complicated things to learn and remember, half the job of the computer is to know things I can't put in my brain. If it can do the regex for me, I suddenly get 70% of the value at no cost.
Regex is not a tool I need often enough to justify the hours and brain space. But it is still an indespensible tool. So when I need a regex, I either ask a human wizard I know, or now I ask my computer directly.
by mystified5016
1/15/2025 at 7:06:03 AM
It's self-reinforcing though. If you invest the time to learn, then you may find yourself (i a beafutiful house :)) using it a lot more than two times a week.by imp0cat
1/15/2025 at 9:09:49 AM
I think this way of thinking is an uphill battle.My kid uses wifi, google classroom tools, youtube, games,... I can tell him if only you knew command.com, ipconfig, doom.wad formats, lateg,... you could be so much more proficient. I already know this will never happen, just like I never learned x86 assembly.
The same goes for tools like LLMs, once you are used to them, your knowledge shifts.
by nuancebydefault
1/15/2025 at 2:55:30 PM
I learned a bit of assembly and it was a god send to understand linkers and how FFI works. Also make it easier to have a good model on using a debugger. Not that useful on a tactical level, but really great when faced with some errors in your compiled programs.by skydhash
1/14/2025 at 11:44:39 PM
Not sure about ffmpeg, but you should definitely try memorising regexp. Casual Search&replace that becomes possible is worth it.by earnestinger
1/15/2025 at 12:04:21 AM
in 15 years it never sticks and by the time i need it again i've forgotten it! :Dby sergiotapia
1/15/2025 at 12:41:08 AM
Don't learn the Perl influenced extensions. You just need POSIX EREs (and BREs for some older utilities) which are simple enough to keep in the head.by kevin_thibedeau
1/15/2025 at 2:04:45 AM
And with the POSIX EREs, everything except the syntax for character classes will carry over to other common Regex dialects.by ziml77
1/15/2025 at 3:43:25 AM
I'll just leave this here"The Insanity Of Linux's Regular Expressions " https://www.youtube.com/watch?v=ys7yUyyQA-Y
by Over2Chars
1/15/2025 at 9:55:15 AM
Many loosely compatible (so, not compatible if we are strict..) implementations are a problem, but awareness and testing helps.All in all, my life would be miserable if I would not have regexp available in grep/sed/editor/ide/java/python, their usefulness trump any such inconveniences.
by earnestinger
1/14/2025 at 11:33:53 PM
Gotta be honest, years of configuring automod on Reddit have honed me into a regex God.by teaearlgraycold
1/15/2025 at 8:35:08 AM
For me, it wasn’t so much learning ffmpeg, as it was understanding containers/codecs/encoders/streams/etc. Learning all of the intricacies there made ffmpeg make a lot more sense.by jmb99
1/15/2025 at 3:00:19 PM
Almost no one cares to understand the domain of the tool anymore, they only want result and expect a simplified interface that already does the unique thing they want to do, but can’t accept that a power tool can only be used with training.by skydhash
1/15/2025 at 2:59:17 AM
... Then ChatGPT came along and I had 3 problems! https://regex.info/blog/2006-09-15/247by shlomo_z
1/14/2025 at 10:15:25 PM
CSS has entered the ChatGPT.by hackingonempty
1/14/2025 at 11:58:16 PM
My rule for using LLMs is that anything that's one off is okay. Anything that's more permanent and committed to a repo needs a human review. I strongly suggest you have an understanding of the basics (at least the box model) so that you are competent at reviewing CSS code before using LLM for that.by kccqzy
1/14/2025 at 11:25:05 PM
I've been looking for a good guide on prompting LLMs for CSS.does anyone know of any?
by permo-w
1/15/2025 at 2:48:37 PM
I have no set of rules when prompting LLMs for CSS, it does seem to work more or less for me though.What are your current issues or what limitations have you ran into?
by johnisgood
1/16/2025 at 5:01:20 AM
mainly with trying to impart a particular art style.by permo-w
1/14/2025 at 10:47:51 PM
Same here, it's one of these things where AI has taken over completely and I'm just a broker that copy-pastes error traces.by juancroldan
1/15/2025 at 7:38:06 AM
In addition to the many others mentioned, here's a script I just threw together that simplifies a lot of these chained commands - llmpeg: https://github.com/jjcm/llmpegIf you have ffmpeg installed and an OpenAI env api key set, it should work out of the box.
Demo: https://image.non.io/1c7a92ef-0917-49ef-9460-6298c7a9116c.we...
by jjcm
1/14/2025 at 10:48:28 PM
My experience got even better once I learned how complex filters worked.by magarnicle
1/14/2025 at 11:12:38 PM
learning how to use splits to do multiple things all in one command is a god send. the savings of only needed to read the source and convert to baseband video once is a great savings.i started with avisynth, and it took time for my brain to switch to ffmpeg. i don't know how i could function without ffmpeg at this point
by dylan604
1/15/2025 at 9:52:09 AM
Truly, a net positive to my life. Just a few days ago I asked my AI buddy (Claude) to create a zsh script to organize my downloads folder according to the Johnny Decimal system. I’ve since modified it to move the files to a JD setup on my desktop.The sense of elation I get when I wonder aloud to my digital friend and they generate what I thought was too much to expect. Well worth the subscription.
by NetOpWibby
1/15/2025 at 4:41:44 AM
I ended up creating my own tool to generate ffmpeg commands https://newbeelearn.com/tools/videoeditor/by pdyc
1/15/2025 at 3:26:20 AM
I think you're onto something. I've had hit or miss experiences with code from LLMs but it definitely makes the searching part different.I had a problem I'd been thinking about for some time and I thought "Ill have some LLM give me an answer" and it did - it was wrong and didn't work but it got me to thinking about the problem in a slightly different way and my quacks after that got me an exact solution to this problem.
So I'm willing to give the AI more than partial credit.
by Over2Chars
1/15/2025 at 6:47:54 AM
Basic syntax for re-encoding a video file did take me some time to memorize, but isn't in fact too hard: ffmpeg <Input file(s)> <Codec(s)> <MAPping of streams> <Video Filters> output_file
- input file: -i, can be repeated for multiple input files, like so: ffmpeg -i file1.mp4 -i file2.mkv
If there is more than one input file then some mapping is needed to decide what goes out in the output file.- codec: -c:x where x is the type of codec (v: video, a: audio or s:subtitles), followed by its name, like so:
-c:v libx265
I usually never set the audio codec as the guesses made by ffmpeg, based on output file type, are always right (in my experience), but deciding the video codec is useful, and so is the subtitles codec, as not all containers (file formats) support all codecs; mkv is the most flexible for subtitles codecs.- mapping of streams: -map <input_file>:<stream_type>:<order>, like so:
-map 0:v:0 -map 1:a:1 -map 1:a:0 -map 1:s:4
Map tells ffmpeg what stream from the input files to put in the output file. The first number is the position of the input file in the command, so if we're following the same example as above, '0' would be 'file1.mp4' and '1' would be 'file2.mkv'. The parameter in the middle is the stream type (v for video, a for audio, s for subtitles). The last number is the position of the stream IN THE INPUT FILE (NOT in the output file).The position of the stream in the output file is determined by the position of the map command in the command line, so for example in the command above we are inverting the position of the audio streams (taken from 'file2.mkv'), as audio stream 1 will be in first position in the output file, and audio stream 0 (the first in the second input file) will be in second position in the output file.
This map thing is for me the most counter-intuitive because it's unusual for a CLI to be order-dependent. But, well, it is.
- video filters: -vf
Video filters can be extremely complex and I don't pretend to know how to use them by heart. But one simple video filter that I use often is 'scale', for resizing a video:
-vf scale=<width>:<height>
width and height can be exact values in pixels, or one of them can be '-1' and then ffmpeg computes it based on the current aspect ratio and the other provided value, like this for example: -vf scale=320:-1
This doesn't always work because the computed value should be an even integer; if it's not, ffmpeg will raise an error and tell you why; then you can replace the -1 with the nearest even integer (I wonder why it can't do that by itself, but apparently, it can't).And that's about it! ffmpeg options are immense, but this gets me through 90% of my video encoding needs, without looking at a manual or ask an LLM. (The only other options I use often are -ss and -t for start time and duration, to time-crop a video.)
by bambax
1/15/2025 at 8:49:19 AM
> This doesn't always work because the computed value should be an even integer; if it's not, ffmpeg will raise an error and tell you why; then you can replace the -1 with the nearest even integer (I wonder why it can't do that by itself, but apparently, it can't).It's not about integer, but some of the sizes need to be even. You can use `-vf scale=320:-2` to ensure that.
by izacus
1/15/2025 at 11:49:42 AM
It's hard for a number to be even without first being an integer, no? ;-)But thanks for '-2', didn't know about that! It's the exact default option I needed! Will be using that always from now on.
https://stackoverflow.com/questions/71092347/ffmeg-option-sc...
by bambax
1/15/2025 at 8:38:09 AM
> then you can replace the -1 with the nearest even integer (I wonder why it can't do that by itself, but apparently, it can't).Likely because the aspect ratio will no longer be the same. There will either be lost information (cropping), compression/stretching, or black bars, none of which should be default behaviour. Hence, the warning.
by jmb99
1/15/2025 at 7:30:21 AM
I would like to throw in a tool that I built into the ring: gencmd - https://gencmd.com/. There is a web version and also a CLI version.If the CLI is installed, you can do: gencmd -c ffmpeg extract first 1 minute of video
Or you can just search for the same in the browser page.
by sathishvj
1/15/2025 at 2:27:03 PM
I do it the old way: I write down the commands as a shell script, and reuse later.But really what ffmpeg is missing is an expressive language to describe its operation. Something well-structured, like what jq does for JSON.
by nine_k
1/15/2025 at 3:03:46 PM
It already does. It’s the cli flags. What you’re missing is the semantic which you can get with learning about containers, codecs, and other stuff. You don’t use grep and sed with no understanding of what a text file is.by skydhash
1/16/2025 at 12:59:37 PM
ffmpeg and jq are 2 commands I've about given up trying to "use" with any facility and am more than happy to pawn that off to one of the Gippity's; chat, claude, etc.by michaelcampbell
1/15/2025 at 2:53:44 AM
For me it was using a container of it, instead of having to install all the things FFmpeg needs on a machine.by urda
1/15/2025 at 12:02:52 PM
Why not just use Handbrake? It’s just FFMpeg but with a GUI.by archerx
1/15/2025 at 2:34:20 PM
That's fine for encoding but Handbrake doesn't let you do video streaming to my knowledge.by wildzzz
1/15/2025 at 6:05:37 AM
llm - Clippit of 202x, but for the original Pentium was enough.by skirge