5/19/2025 at 5:45:56 PM
It still seems to have the problems most other LLMs suffer with except Gemini: it loses context so quickly.I asked it about a paper I was looking at (SLOG [0]) and it basically lost the context of what "slog" referred to after 3 prompts.
1. I asked for an example transaction illustrating the key advantages of the SLOG approach. It responded with some general DB transaction stuff.
2. I then said "no use slog like we were talking about" and then it gave me a golang example using the log/slog package
Even without the weird political things around Grok, it just isn't that good.
by scuol
5/20/2025 at 4:45:28 AM
When I use the "think" mode it retains context for longer. I tested with 5k lines of c compiler code and I could 6 prompts in before it started forgetting or generalizingI'll say that grok is really excellent at helping my understand the codebase, but some miss-named functions or variables will trip it up..
by convivialdingo
5/21/2025 at 5:01:14 AM
not from a tech field at all but would it do the context window any good to use "think" mode but discard them once the llm gives the final answer/reply?is that even possible to disregard genrated token's selectively?
by pomtato
5/20/2025 at 7:13:59 AM
it also doesn't help that many of these companies tend to either limit the context of the chat to the 10 most recent messages (5 back and forths), or rewrite the history summarized in a few sentences. Both ways lose a ton of information, but you can avoid that behaviour by going through the APIs. Especially Azure OpenAI et... on the web is useless, but it's quite capable through custom APsI think Gemini is just the only one that by default keeps the entire history verbatim.
by dahcryn
5/21/2025 at 4:26:47 PM
for me xAI has its place mainly for 1) exclusive access to tweets and 2) being uncensored. and it's decent enough (even if it's not the best) in terms other capabilitiesby aibrother
5/21/2025 at 7:29:09 PM
> being uncensoredWith the recent article on how it was easily manipulated, I wouldn't be so confident it is uncensored, just that its bias is leaning into its owner's beliefs; which isn't great.
Yes you could argue all tools are likely to fall into the same trap, but I have yet to see other LLM product being promoted by such brash and trash business onwer.
by touristtam
5/20/2025 at 10:01:29 AM
The paid version "SuperGrok" has a larger context window, but nothing beats Gemini for that.I tried your question with SuperGrok. Here's the result.
https://grok.com/share/bGVnYWN5_d298dd12-9942-411c-900c-2994...
I use Grok for similar tasks and usually prefer Grok's explanations. Easier to understand.
For some problems where I've asked Grok to use formal logical reasoning I have seen Grok outperform both Gemini 2.5 Pro and ChatGPT-o3. It is well trained on logic.
I've seen Grok generate more detailed and accurate descriptions of images that I uploaded. Grok is natively multimodal.
There is no single LLM that outperforms all of the others at all tasks. I've seen all of the frontier models strongly outperform each other at specific tasks. If I was forced to use only one, that would be Gemini 2.5 Pro (for now) because it can process a million tokens and generate much longer output than the others.
by voidspark
5/20/2025 at 3:06:12 AM
[flagged]by Gigachad
5/20/2025 at 7:54:32 AM
Be careful saying things like that or you'll get [flagged] - discussion of what seemed an incredibly important subject is forbidden on here it seems.by srmarm
5/20/2025 at 12:00:39 PM
Hilarious that you correctly predicted this being flagged. Forbidden topic on HN it seems.by Gigachad
5/20/2025 at 9:11:14 AM
[flagged]by rsynnott
5/20/2025 at 9:59:36 AM
Effective Altruism is still great, and never stopped being great. Guilt does not transfer by association in this way.by tormeh
5/20/2025 at 10:40:22 AM
There's definitely _something_ there, but, as with all philosophies, the internet has taken it and run with it to a fairly absurd degree, to the point where, for many adherents, it's basically a religion.by rsynnott
5/20/2025 at 10:06:58 AM
It's not. Feeding kids, researching vaccines and a bunch of other things that billionaires are funding should not depend on the graces and whims of billionaires, it should be something provided for by the government.by mschuster91
5/20/2025 at 10:06:05 AM
HN crowd is ... mixed, it's perhaps the one last true melting pot we have on the Internet. A curse and a blessing, if you ask me.You got truly anything here. Europeans that in general tend to lean more towards "democratic socialism" and its various offshoots, American libertarians (which have a large intersection with Musk fanboys), a bunch of extremely rich startup founders, American progressives, conservatives of all kinds, Zionists and Hamas apologists, probably Russian and Chinese psy-ops, accelerationists, preppers... name any ideology and you'll find supporters on HN.
What has changed a bit is that tribalism seems to have taken over from civilized or at least arguments and fact oriented discourse. Personally, I'd prefer if downvotes and especially flags would require one to give a reason so that repeat offenders that just flag and downvote everything they disagree with can get suspended for ruining discussion.
by mschuster91
5/20/2025 at 10:39:24 AM
Interesting how you put "hamas apologists" and not pro-palestinians next to Zionists. How would you have felt if it was written "pro-palestinains and genocide-apologists"?by Snow_Falls
5/20/2025 at 10:56:19 AM
[flagged]by mschuster91
5/20/2025 at 2:59:07 PM
If you want to meet pro-palestinians that don't have cartoonishly stereotyped opinions I suggest meatspace and not online.by skyyler
5/20/2025 at 1:35:06 PM
Do you know what a "bubble" is? In fact, do you actually know any pro-palestinain people or do you get media that tells you about them? These are not the same thing. Very neat that you included "from the river to the sea" as right alongside rape. Very telling.PS you can find street interviews of random isreali's where they will straight up tell you they wish all palestinians were killed with very little prompting. But I guess they just don't count huh?
by Snow_Falls
5/20/2025 at 11:44:47 AM
> I have yet to meet any "pro-palestinian" that doesn't devolve into "rape is resistance"> In contrast, all Zionists I know utterly despise Netanyahu and his far-right government.
Oh dear
As it happens, I know plenty of people who don't think the people in Gaza should be genocided and none of them support rape.
Many of the self-labelled Zionists I know support Bibi and think Gaza should be razed to the ground.
Go figure!
by monkey_monkey
5/20/2025 at 8:17:37 AM
You never know when it will start spouting it either. That kind of uncertainty in the responses landing in your interface is just not sustainable. Your money is coming from the quality of the content your system is putting out. If it's being used for dentistry, and it randomly spits out white supremacist content, dentists will look for a system that won't do that. Because they asked about, say, intaglio surfaces for a wearable dental appliance. Not a treatise on white genocide.At this point, to use Grok, you'd be intentionally setting your startup to detonate itself at some random point in the future. That's just not how you make money.
by bilbo0s
5/20/2025 at 6:57:58 AM
So.. If the 'source' of data is 9gag, 4chan, you will get 'this' material. If you feed it Tumlr, you will get Harry Potter and rope-porn-thingies. If you feed it Hitler's speeches, you will get 'that' material. If you feed it algebra, you will get 'that' material.Then.. Do we want 'open' or 'curated' LLMs? And how far from reality are the curated LLMs? And how far can curated LLMs take us (black Nazis? female US founding fathers?).
Pick your poison I say.. and be careful what you wish for. There is no "perfect" LLM because there is no "perfect" dataset, and Sam-Altman-types-of-humans are definitely deeply flawed. But life is flawed, so our tools are/will be flawed.
by HenryBemis
5/20/2025 at 7:19:44 AM
The problem was not the source of the training data. xAI confirmed that the system prompt had been modified to make grok talk about South African white genocide.While they didn’t say who modified it. It’s hard to believe it wasn’t Elon.
by Gigachad
5/20/2025 at 8:51:02 AM
> While they didn’t say who modified it. It’s hard to believe it wasn’t Elon.Is it really that hard to understand how these things happen?
The boss says "remove bias" but the peons don't really know how to do that and the naive approach to unbiasing a thing is to introduce bias in the other direction. And then if you're Google and the boss thinks it has a right-wing bias you crook it and get black Nazis and if you're xAI and the boss thinks it has a left-wing bias you get white genocide.
In both cases the actual problem is when people think bias operates like an arithmetic sum, because it doesn't.
by AnthonyMouse
5/20/2025 at 9:01:04 AM
Except someone clearly wanted Grok to talk about some very specific South African phrases and events, not just “remove bias” in the general sense.by archagon
5/20/2025 at 9:28:24 AM
That's precisely how the arithmetic theory of bias operates. That bias doesn't actually work that way is why applying it causes such ridiculous outcomes.by AnthonyMouse
5/20/2025 at 10:42:14 AM
[dead]by computerthings
5/21/2025 at 8:35:59 AM
The term "kill the boer" was almost certainly added to the system prompt because Grok would begin talking about specifically that song, unprompted, to millions of people no matter what they were talking about.This is not a case of trying to remove bias. I don't for a second believe anyone from the demographic using this site acting naive about that either, just have whatever political opinion and don't pretend this is respectable.
by chownie
5/20/2025 at 7:38:55 AM
Eye rollby jeffhuys
5/20/2025 at 7:52:30 AM
It's hard to believe it is Elon either, I don't think he would know how unless they made a special interface for himby drkleiner
5/20/2025 at 8:43:33 AM
He is the CEO of a company, he can personally ask someone to do it for him.by WhyIsItAlwaysHN
5/20/2025 at 7:33:56 PM
But implementation was too messy for someone with expertise to do it on CEO's requestby drkleiner