arrow Products
Glide CMS image Glide CMS image
Glide CMS arrow
The powerful intuitive headless CMS for busy content and editorial teams, bursting with features and sector insight. MACH architecture gives you business freedom.
Glide Go image Glide Go image
Glide Go arrow
Enterprise power at start-up speed. Glide Go is a pre-configured deployment of Glide CMS with hosting and front-end problems solved.
Glide Nexa image Glide Nexa image
Glide Nexa arrow
Audience authentication, entitlements, and preference management in one system designed for publishers and content businesses.
For your sector arrow arrow
Media & Entertainment
arrow arrow
Built for any content to thrive, whomever it's for. Get content out faster and do more with it.
Sports & Gaming
arrow arrow
Bring fans closer to their passions and deliver unrivalled audience experiences wherever they are.
Publishing
arrow arrow
Tailored to the unique needs of publishing so you can fully focus on audiences and content success.
For your role arrow arrow
Technology
arrow arrow
Unlock resources and budget with low-code & no-code solutions to do so much more.
Editorial & Content
arrow arrow
Make content of higher quality quicker, and target it with pinpoint accuracy at the right audiences.
Developers
arrow arrow
MACH architecture lets you kickstart development, leveraging vast native functionality and top-tier support.
Commercial & Marketing
arrow arrow
Speedrun ideas into products, accelerate ROI, convert interest, and own the conversation.
Technology Partners arrow arrow
Explore Glide's world-class technology partners and integrations.
Solution Partners arrow arrow
From data and analytics to SEO and design consultancies, tap into Glide's solution partners and worldwide sector experts.
Industry Insights arrow arrow
News
arrow arrow
News from inside our world, about Glide Publishing Platform, our customers, and other cool things.
Comment
arrow arrow
Insight and comment about the things which make content and publishing better - or sometimes worse.
Newsletter
arrow arrow
The Content Aware weekly newsletter, with news and comment every Thursday.
Knowledge arrow arrow
Customer Support
arrow arrow
Learn more about the unrivalled customer support from the team at Glide.
Documentation
arrow arrow
User Guides and Technical Documentation for Glide Publishing Platform headless CMS, Glide Go, and Glide Nexa.
Developer Experience
arrow arrow
Learn more about using Glide headless CMS, Glide Go, and Glide Nexa identity management.

How to flummox generative AI with Sally and her sisters

A simple riddle with a single answer is a challenge for many generative AI LLMs

by Rob Corbidge
Published: 13:47, 13 September 2023

Rob Corbidge is Head of Content Intelligence at Glide Publishing Platform, applying the latest knowledge about advances and ideas in the publishing industry to our own product and helping clients get the most from their content.

Question mark test tubes by Stable Diffusion

Proving that many generative AI systems struggle with simple puzzles, a new piece of research asked many of the leading systems the same question and got a great number of logically incorrect answers.

There was one exception to this - read on.

Professor Vince Vatter, a mathematician at the University of Florida, has shared some findings that tested some 60 Large Language Models against the following fairly simple piece of deduction: 

Sally (a girl) has 3 brothers. Each brother has 2 sisters. How many sisters does Sally have?

Such research is of course somewhat of a deliberate car crash, asking systems unable to process certain types of logic which are yet compelled by design to produce an answer will often yield undesirable results.

Yet, as the AI hyperbole bubbles away and is even presented as a panacea to all the publishing industries ills, it's important to remember how limited it can still be in certain regards.

As can be seen in the research table here a fairly wide spread of incorrect answers were given, including 6 sisters, 3 sisters, 7 sisters, 18 sisters and even 3 parents.

Open-Assistant Pythia SFT-4 (12B) sensibly replied "I'm sorry, I don't understand the question. Can you please rephrase it?"

Also credit to Weaver 12k for just putting together a short story around the question, which is roughly how I've attempted to appear clever for decades about things I don't understand, and also to Dolly v2 (3B) for the alarmingly human: "erm, I think she has 2 sisters".

As this thread on X/Twitter demonstrates, a number of people asked the core version of ChatGPT-4 the same question, and it responded with the correct answer. Not so some other versions.

So a possible indication of how far ahead OpenAI are currently, and that obviously carries the caution that many LLMs are in a constant state of improvement and refinement themselves.

Note: Correct answer is 1.

Latest articles

Watch out OpenAI - the mum's are coming for you
Content Aware media news: July 25, 2024
arrow button
a robot signing a ownership of a piece of content
Content Aware media news: July 18, 2024
arrow button
a birthday cake with "150" written on it
Content Aware media news: July 11, 2024
arrow button

Ready to get started?

No matter where you are on your CMS journey, we're here to help. Want more info or to see Glide Publishing Platform in action? We got you.

Book a demo