7/5/2026 at 8:14:07 AM
Is there anyone feeling that Pandoc is ever increasingly bloated? I have used Lua filters a decade ago [1] and the current documentation is nothing like my memories. I'm not even sure that how much of Lua scripts remain compatible across different Pandoc versions.[1] https://github.com/mearie/mearie.github.io/blob/source/res/w...
by lifthrasiir
7/5/2026 at 11:21:01 AM
With a tagline of "a universal document converter" it is almost a guarantee to become a complicated program but how much of it is being used for any single conversion?Two more examples:
Rclone is "bloated" but it needs to be in order to fulfill its purpose.
ZFS is "bloated" because it combines volumes and filesystems but breaking the Unix philosophy also enables a different kind of synergy and simplicity elsewhere.
by zenoprax
7/5/2026 at 3:06:14 PM
"bloat" just means "any feature I am not personally using therefore I deem as useless and pointless".by Blackthorn
7/5/2026 at 11:31:14 AM
A universal document converter is expected to expand via adding support for additional formats---that's okay (same for your other examples). I'm much more worried about the widening scope of the project.by lifthrasiir
7/5/2026 at 1:35:13 PM
A universal document converter knows what document it is working with and what to do with it once it has it. 'What a document is' is an AST that has resulted from a few thousand years of literate civilization. You can detect the outline of this AST - or AAST as you might call it - by asking what must be preserved in a different printing of the same, or in a translation.A universal document converter is 'expected' to admit transformations on the AST of a document. Luafilters do this more or less directly; operations via json representation do it in another.
I never used luafilters before, not knowing lua, but these days use them all the time for simple problems and am getting a clearer picture of the possibilities. This is because claude and codex write luafilters at the drop of a hat.
One simple illustration I have found of use with academic writing published inter alia in html arises from the willful decision of the html bureaucracy never to include a footnote syntax - and thus fall short of ABCs of any document concept however narrow and curtailed - because having said 'o we don't need footnotes, we have hypertext' back in clintontime they are too proud to change. In fact of course html is the format par excellence of footnotes ... as a gander at wikipedia will tell you. Pandoc can't parse them out of html - including its own html - since there is nothing to parse: the reader recognizes them by inspection in the browser. But you can ask claude to write a lua filter e.g. recognizing pandoc's own html footnotes - which are as arbitrary as everyone else's - and generate the structure intended by the author, in which they are footnotes.
by applicative
7/5/2026 at 12:57:03 PM
[dead]by redsocksfan45
7/5/2026 at 11:31:37 AM
We use it for seven years and it still runs fine when we update Pandoc - we usually always update things. I don’t remember anything about the docs, so not sure what changed.by a1o
7/5/2026 at 1:34:10 PM
I might be worried if it wasn't pandoc. It's always been bulletproof for me.by fwip