4/30/2026 at 2:35:47 PM
Legal professional here. This is NOT a replacement for proper legal AI assistants (e.g. Westlaw, in my jurisdiction). As far as I can tell, this is just a wrapper around regular LLMs i.e. nothing that you couldn't achieve yourself with the right prompting.What legal professionals actually pay for, and that is virtually impossible to replicate unfortunately, is to give the AI access to a legal database of case law. Without case law, you can't do accurate legal research, and you are inviting disaster if you're doing things like drafting statements of case or skeleton arguments.
There's a reason why companies like Thomson Reuters have an oligopoly on these types of products, and can get away with charging thousands a year. They are the only ones with access to a comprehensive set of case law, and they've entrenched their position by having exclusive contracts with the law reporting companies. Without that, your model is just relying on publicly available cases that it can find on Google etc., and that's just a fraction of the full set.
With that said, these types of competitor products can be useful if you're just doing simple tasks like drafting letters or reviewing contracts and you accept that you need to do the legal research separately. But again, you can get that with just ChatGPT + a good prompt.
by jbstack
4/30/2026 at 2:52:51 PM
> There's a reason why companies like Thomson Reuters have an oligopoly on these types of products, and can get away with charging thousands a year. They are the only ones with access to a comprehensive set of case law, and they've entrenched their position by having exclusive contracts with the law reporting companies.I'm not in the legal field, but can someone explain that further? I would have expected that all case law is public access. Not necessarily easy access, but when a judge writes an opinion, why on Earth would that opinion be gated behind a corporation? What am I missing?
by hn_throwaway_99
4/30/2026 at 3:13:03 PM
Yes, it sounds crazy and against the principle of open justice, but unfortunately this is the reality. Certainly in the UK which is my jurisdiction - and I believe in the US too although I don't know for sure.In theory, any member of the public can obtain a judgment by applying for one at the court and paying a fee. That's fine if you just need a one-off judgment, don't mind paying the fee, and you're not in a hurry. It also assumes that you know which case you need.
For realistic legal research, you might need to wade through dozens of cases just to even know if any of them are relevant, you might have a deadline of tomorrow to get it done, and you might not want to pay that fee for a bunch of cases that you aren't going to end up needing. Only a company which already has a comprehensive copy of virtually every important case can help you here.
A typical workflow for a complex piece of legal research might look like this:
1. You need to research a legal topic.
2. Do some Googling, or chat to your LLM, to get a rough overview and some pointers for further research (but don't completely rely on what you find).
3. Read some professional content (e.g. Practical Law articles relevant to the topic, or a legal textbook).
4. Read the relevant legislation.
5. Use a legal database to download all the cases you found from steps 2 and 3 which seem like they might be relevant.
6. Use a legal database to download all the cases which cite the relevant legislative provisions you found in step 4 and seem like they might be relevant.
7. Use the legal database to confirm that those cases are still good law (not overridden or criticised by a later case).
8. Skim read them, discard those that turned out to obviously not be relevant.
9. Read the remaining ones more closely.
10. Note any useful-looking cases which are cited in the ones from step 9, and recursively work your way through those cases as well.
Relying on court-provided copies of judgments won't realistically help you with most of these steps.
by jbstack
4/30/2026 at 10:45:28 PM
Related question, then - what do judges use when they have to write opinions in the first place? Do they have to follow the same process and use Thomson Reuters?It's obviously even more important for judges (compared to lawyers) to be able to easily search all of the relevant case law to see which cases are controlling and would have precedence. Seems bizarre to me that this critical function would be gated behind a corporation.
by hn_throwaway_99
5/2/2026 at 10:51:06 AM
The other answer (staff who use Westlaw) is right, but critically this is the point of adversarial litigation. The justice system doesn't hang on that judge finding the precedent; it assumes that the highly motivated lawyers on either side will find the relevant precedent that helps their case and highlight it for the judge.by tylervigen
5/1/2026 at 9:35:35 AM
They have staff that do it for them :-)by gadders
5/2/2026 at 10:21:08 AM
And those staff mostly use Westlaw :)by ferngodfather
4/30/2026 at 7:27:13 PM
those steps are what a RAG LLM agent setup excels at.If tech companies invested 10% of what they have in AI assisted coding tools into AI assisted legal tools, they would be able to do those steps easily.
It is definitely coming.
by ericmcer
5/1/2026 at 5:28:48 PM
You can have as much RAG as you like, but if you're missing the data itself (the legal judgments), it's useless. The fundamental problem here isn't technical, it's that a very small number of corporations have complete control over the source material.by jbstack
5/1/2026 at 4:02:03 AM
This all needs to be publicly accessible for free. Gonna see how blatently inconsistent laws and interpretations areby whattheheckheck
5/2/2026 at 12:52:52 PM
To be fair, there's sufficient of it publicly available (e.g. on https://www.bailii.org/) that you can easily disprove the conspiracy theory that laws and interpretations are "blatently inconsistent". In fact most judgments are very well thought through and carefully written and cited, because those that aren't tend to be appealed if they are establishing an important principle and getting it wrong. There just isn't enough publicly available to base a full legal AI assistant on.by jbstack
4/30/2026 at 3:11:09 PM
Do not know about the US, but some countries publish some type of high profile cases but _only_ after anonimization for obvious privacy reasons.To access the DB through the modern archive (well modern as new rules) you'd have to be an accredited professional passing through a few legal hardles and digital chancellor's office for each copy. It's like going to a bureaucracy^bureaucracy office.
Some early companies given their initial foothold were not required these checks so they were able to get hold of bigger archives (it's also important to remember often legislation or conformity is done through consulting or lobbying done by these entrenched players).
They can also build on the Data professionals themselves submit.
by eagleal
5/3/2026 at 7:41:55 PM
> Do not know about the US, but some countries publish some type of high profile cases but _only_ after anonimization for obvious privacy reasons.It doesn't work that way in the US. Legal judgements are documents of public record, and they are normally published in full - it's not uncommon to search for someone's name and see legal cases pop up that they have been involved in.
There are specific instances where a judge can seal a judicial record (and records for minors are sealed automatically), someone may petition for their own records to be expunged, and parties may ask for some information to be redacted, but these are normally (except in the case of minors) not done automatically. As I understand it, the US has much more lax rules around the publicity of legal proceedings than other jurisdictions. For example, even though someone is deemed "innocent until proven guilty", arrest records and mugshots are reported all the time in the media even though this would be illegal in many other areas of the world.
by hn_throwaway_99
4/30/2026 at 5:14:14 PM
This is great on this topic: https://www.thebignewsletter.com/p/gatekeepers-of-law-inside...by jonners00
5/1/2026 at 12:23:42 AM
In Canada there is a database of a lot of case law here: https://www.canlii.org/Theoretically speaking if someone scraped all of it and added it to something like this open source Mike project would that then be a much better tool for lawyers?
by r0fl
5/2/2026 at 12:44:28 PM
> would that then be a much better toolBetter than before, yes. Good for general legal work that doesn't require robust legal research, yes. Sufficient for full legal research, no.
The problem is that "a lot of case law" isn't enough case law. You need close to everything. Otherwise this can happen: Canlii case X -> Legal principle Y. Westlaw case Z not on Canlii -> X overriden, Y no longer good law. Or you might simply not find a case which cogently supports your argument, when one does in fact exist. Or, conversely, you are unaware of a detrimental case which your opponent knows about because they have Westlaw.
by jbstack
5/2/2026 at 10:22:03 AM
We have bailii.org but it doesn't have everything.by ferngodfather
4/30/2026 at 3:17:07 PM
> Legal professional here. This is NOT a replacement for proper legal AI assistants (e.g. Westlaw, in my jurisdiction). As far as I can tell, this is just a wrapper around regular LLMs i.e. nothing that you couldn't achieve yourself with the right prompting.I'd not use generative AI for anything but a cursory check anyway⁰. Even if it is trained on clean up-to-date data rather than all the wrong information that is out there, it could still give a wrong answer and I have no leg to stand on if I rely upon it. At least if I pay a human and they trust the LLM too much, I'll hopefully have some call to pursue them for giving bad advice when it bites me.
--------
[0] Or at all… But even if I wasn't someone actively avoiding LLMs, the point would still stand
by dspillett
5/2/2026 at 12:45:19 AM
https://case.law/by daydream
4/30/2026 at 3:24:56 PM
What does legora, harvey, or crosby add here other than the default westlaw/TR/lexis integrations?by htrp
4/30/2026 at 10:39:58 PM
I'd imagine it's like using Cursor/Claude Code vs. a Jetbrains IDE plugin.by david_shi
5/2/2026 at 8:23:49 PM
To be clear, the value of services like Legora and Harvey was not as a replacement for legal-specific services like Westlaw. LLMs are absolutely abysmal at legal research and analysis, and every week we are seeing reports of some lawyer being called out publicly for submitting a brief to a court with hallucinated case citations. This has happened to lawyers at all levels, including lawyers at big law firms. If you ask an LLM questions about very basic principles of law, it will do a decent job, but anything requiring nuance or specialization, they will spit out things that sound plausible, but are not correct.And this is not likely to change in the future, as the legal market is so small and niche that the leading makers of LLMs have put legal analysis near the bottom of their list of priorities in terms of improving model performance. There is very little if any effort by the major LLM companies to curate sources of additional high-quality legal training data or fine-tune their models to improve performance on legal tasks. Law is also a field with very low tolerance for error, where tiny mistakes can have big consequences, and getting the models to perform well under these constraints would require a lot of investment without a sufficient payoff.
The true reason big firms are buying Harvey and Legora subscriptions is simply to use an LLM for LLM-type tasks, like document review, spotting issues in user-provided documents, and other things that LLMs do well. True, services like Harvey and Legora have lots of cool templates and features for legal work, but you will find that most of the people who use these services in these firms use them much the same way they'd use ChatGPT, Claude, or any other AI chatbot.
The reason law firms can't just use ChatGPT or Claude is that they can't allow confidential or privileged client data (such as documents provided through the prompts) to be stored and hosted on a third party service like ChatGPT or Claude, as these companies may have to turn over client data in response to subpoenas, and depending on the type of LLM account you have, these companies could use your user prompts to train future models thus risking leakage of client data to third parties and potential privilege waiver.
Services like Harvey and Legora solve this problem by accessing the LLMs through APIs, and all client data, prompts/responses, etc., are stored encrypted on Harvey or Legora servers and protected by keys held by the customer. For many law firms, this is 95% of what they're paying for.
The big challenge "Mike" presents to services like Harvey and Legora is that it exposes how little additional value they offer over ChatGPT or Claude, for the vast majority of law firms. A system like "Mike" can provide the same security benefit at basically $0 cost, and can be hosted on the law firm's own internal servers. This is going to put a lot of pricing pressure on services like Harvey and Legora; law firms are notoriously cheap when it comes to IT and software spend and will switch quickly if cheaper alternatives arise. This confirms that Harvey and Legora are going to have to sell their services based on the value they add to lawyer productivity, and not just on being a protected wrapper around GPT or Claude.
by VoidCoefficient