3/30/2025 at 1:45:41 AM
I prefer to use clickhouse-local for all my CSV needs as I don't need to learn a new language (or cli flags) and can just leverage SQL. clickhouse local --file medias.csv --query "SELECT edito, count() AS count from table group by all order by count FORMAT PrettyCompact"
┌─edito──────┬─count─┐
│ agence │ 1 │
│ agrégateur │ 10 │
│ plateforme │ 14 │
│ individu │ 30 │
│ media │ 423 │
└────────────┴───────┘
With clickhouse-local, I can do lot more as I can leverage full power of clickhouse.
by pradeepchhetri
3/30/2025 at 5:22:25 PM
I used to use q for this sort of thing. Not sure if there are better choices now as it have been a few years.by SoftTalker
3/30/2025 at 4:43:26 PM
How does it compare with duckdb, which I usualy resort to? What I like with duckdb is that it's a single binary, no server needed, and it's been happy so far with all the CSV file I've thrown at it.by rixed
3/31/2025 at 2:27:17 AM
clickhouse-local is similar to duckdb, you don't need a clickhouse-server running in order to use clickhouse-local. You just need to download the clickhouse binary and start using it. clickhouse local
ClickHouse local version 25.4.1.1143 (official build).
:)
There are few benefits of using clickhouse-local since ClickHouse can just do lot more than DuckDB. One such example is handling compressed files. ClickHouse can handle compressed files with formats ranging from zstd, lz4, snappy, gz, xz, bz2, zip, tar, 7zip. clickhouse local --query "SELECT count() FROM file('top-1m-2018-01-10.csv.zip :: *.csv')"
1000000
Also clickhouse-local is much more efficient in handling big csv files[0]
by pradeepchhetri
4/2/2025 at 6:29:13 AM
Wanted to try it.Debian package is of poor quality: not even sure if clickhouse local is included in there, I believe so but there is no manpage, no doc at all, and no `clickhouse-server -h`.
Went to the official page looking for a tarball to download, found only the `curl|sh` joke.
Went to github looking for tagged tarballs, couldn't find any. Looked for INSTALL.md, couldn't find any.
Will try harder later, have to weep my tears for now.
by rixed
4/9/2025 at 4:14:49 PM
ClickHouse is a single binary. It can be invoked as clickhouse-server, clickhouse-client, and clickhouse-local. The help is available as `clickhouse-local --help`. clickhouse-local also has a shorthand alias, `ch`.This binary is packaged inside .deb, .rpm, and .tgz, and it is also available for direct download. The curl|sh script selects the platform (x86_64, aarch64 x Linux, Mac, FreeBSD) and downloads the appropriate binary.
by zX41ZdbW
4/9/2025 at 3:06:45 AM
> Debian package is of poor qualityCan you elaborate more please? I would love if you can say what all can be improved to make debian package up to standards.
by pradeepchhetri
4/19/2025 at 2:41:20 PM
Thank you for your interest.My comment was really about the state of documentation ("there is no manpage, no doc at all, and no `clickhouse-server -h`"). More specifically:
% dpkg -S clickhouse-server | grep bin
clickhouse-server: /usr/sbin/clickhouse-server
% man clickhouse-server
No manual entry for clickhouse-server
% man clickhouse
No manual entry for clickhouse
% /usr/sbin/clickhouse-server --help
Unknown option specified: help
% /usr/sbin/clickhouse-server -h
Unknown option specified: h
% ls -l /usr/share/doc/clickhouse-server
total 60
-rw-r--r-- 1 root root 235 Dec 5 2022 changelog.Debian.amd64.gz
-rw-r--r-- 1 root root 1437 Dec 5 2022 changelog.Debian.gz
-rw-r--r-- 1 root root 33174 Dec 20 2018 changelog.gz
-rw-r--r-- 1 root root 15057 Oct 29 2022 copyright
by rixed
3/30/2025 at 2:18:21 AM
I use SQLite in a similar manner, but I'll have to check this out.by sitkack