LLM and Prolog, a Marriage in Heaven?

j4n_bur53 · August 4, 2024, 8:07pm

There are more and more papers of this sort:

Reliable Reasoning Beyond Natural Language
To address this, we propose a neurosymbolic
approach that prompts LLMs to extract and encode
all relevant information from a problem statement as
logical code statements, and then use a logic programming
language (Prolog) to conduct the iterative computations of
explicit deductive reasoning.
[2407.11373] Reliable Reasoning Beyond Natural Language

The future of Prolog is bright?

P.S.: This is also fun:

https://twitter.com/billyuchenlin/status/1814254565128335705

stassa.p · August 6, 2024, 5:43pm

A couple of years ago there were a number of papers that used the same approach for planning: they basically translated some natural language instructions to PDDL (the Planning Domain Definition Language, used by all mainstream planners) or, alternatively, Python, then they passed the result to a planner or to a robot’s API.

For example, see the following preprint:

Despite the claims in that paper subsequent work showed severe problems with the approach. See the following for a review of planning using LLMs:

Since the paper you cite is following the same approach, except that it translates reasoning problems to definite programs rather than PDDL and handing off to a Prolog interpreter rather than a planner, I anticipate the same failure modes as with the earlier work- which btw looked like it worked until some experts on planning had a look and pointed out the pressure points that cause the whole effort to collapse.

The problem in general is that LLMs cannot be relied upon to do the translation to a formal language accurately, unless they’ve already seen an accurate translation of what they are called to translate. Translating natural language to a formal language itself requires decision-making that implies understanding of both languages and the domain of discourse and such understanding is absent from LLMs, in novel domains. In other words, the proposed approach might obtain a decent Prolog boilerplate geneator, but it will be brittle and easy to break with simple techniques (like changing the names of symbols as in the obfuscated blockswords domain used to demonstrate the brittleness of LLMs-as-planners).

peter.ludemann · August 6, 2024, 6:50pm

Another paper that mentions LLMs and Prolog: Virtual Machinations: Using Large Language Models as Neural Computers - ACM Queue
which I found from here: Satnam Singh on LinkedIn: Virtual Machinations: Using Large Language Models as Neural Computers
(also: x.com)
“Groq” is a company that’s developing chips that compete with Nvidia (the person who mentioned it has worked at both Google and Meta, so we can only speculate on where this is going …)
There’s also a derivative of Datalog work at Google: GitHub - EvgSkv/logica: Logica is a logic programming language that compiles to SQL. It runs on Google BigQuery, PostgreSQL and SQLite.

free-variation · August 11, 2024, 2:43pm

Yes, I’ve been experimenting with this approach – use the LLM (gpt-4 and sonnet 3.5 tested) to map English to Prolog, query, then map back from Prolog to English. It takes a bit of badgering to get the LLM to not merely sketch a possible Prolog representation rather than generate complete, runnable code, but the method does work.

Tangentially, we’re finding that the LLMs are quite good at transpiling, even across programming language paradigms. So one can prototype in Prolog, then – if engineering objects to running Prolog in production – transpile to some other programming language, e.g. Typescript.

Frank_Schwidom · January 28, 2025, 8:41pm

I asked some random code generator AIs to create code for the tower of hanoi problem but with a variable amount of towers. Without further prompt engeneering all of them built a variant with n towers but the code only used 3 of them and they implemented only the standard algorithm. With prompt engeneering they use the other towers but the solutions are wrong. At least chatgpt can answer why current codegenerator AIs are not able to solve that problem, when specifically asked.
Of course can it be that specialized AIs are able to solve planning problems. Maybe comparable to AlpgaGo or AlphaCode from Deepmind. But I think the real solution comes from solvers. What AIs maybe should do is to learn how to program solvers. But that also means that some NP hard problems will in the end be still NP hard.

stassa.p · January 28, 2025, 8:57pm

Yes, because we only know how to program NP-complete solvers and AIs (i.e. LLMs nowadays) only know how to code what we know how to code.

The day an automated system discovers a polynomial-time decision algorithm for a problem in the class NP is the day when I, at least, will accept we finally have something that deserves to be called artificial intelligence. But that day is not even on the horizon and whatever system manages to do that it’s not going to be a language model, large or small.

Frank_Schwidom · January 30, 2025, 10:33pm

Btw. I found this: https://leandojo.org/
“We introduce Lean Copilot for LLMs to act as copilots in Lean for proof automation,”

This is a similar approach.

j4n_bur53 · May 20, 2025, 5:52am

FYI, a snapshot of the open math copilot race:

From AIMO Competitions to Math Copilots for Research
Simon Frieder | Datasets for Math
https://www.youtube.com/watch?v=XUp3IM66AQA

Topic		Replies	Views
Prolog and LLMs/GenAI? Looking for Group llm	36	693	March 4, 2025
Prolog in the age of generative AI General	24	2184	July 22, 2023
LLM (Large Language Model) such as ChatGPT prompts related to Prolog Wiki chatgpt	0	1924	February 17, 2023
Quite a few basic Prolog questions of ChatGPT Nice to know chatgpt	6	2176	February 8, 2023
Call for Papers: Integrating Logical Reasoning & Large Language Models (LLMs) Call for papers	0	76	June 23, 2025

LLM and Prolog, a Marriage in Heaven?

Related topics