9 Examples of Specification Gaming

9 Examples of Specification Gaming

Robert Miles AI Safety

4 года назад

305,475 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@robpage6768
@robpage6768 - 30.01.2024 03:10

I am the 1 other person who played the strategy games of the world cd-rom! 🫡

Ответить
@authenticallysuperficial9874
@authenticallysuperficial9874 - 26.01.2024 07:37

This is an excellent video, love the examples.

Ответить
@soupcangaming662
@soupcangaming662 - 25.01.2024 16:18

Wouldn't literally every atom, eventually, turn to gold?

Ответить
@vuhuynh8740
@vuhuynh8740 - 18.01.2024 14:11

Thank you.

Ответить
@that_guy1211
@that_guy1211 - 07.01.2024 23:20

I though only the things that touch Mida's hands would turn to gold? Not literally everything that touches his skin or dna

Ответить
@AK-vx4dy
@AK-vx4dy - 03.01.2024 16:52

Solution maybe futher "copying" from human brain, because as "movment machine" human brain run simulation of real world
and tries to predict what will hapen, so AI should have capacity to do it.
So programmers jobs are saved you say ? :)
This cheats really remind humans cheating, so it may be inherent "flaw" of AI mimicking nature brains.
But i don't see it as a flaw but sign of adaptable inteligencia wich is needed for any meaningfull and really usefull AI.
AI will be dangerous, because human have flaws and even if most smart of us try to imagine every possibly scenario
some dumbass come and do what this smart never thought on.
This lesson was learnd(?) already in IT with user interfaces and testing.

Ответить
@josephrissler9847
@josephrissler9847 - 16.12.2023 01:20

Today I learned that The Spiffing Brit is an AI.

Ответить
@nicholasleclerc1583
@nicholasleclerc1583 - 07.12.2023 22:13

But hey, o the bright side, these seemingly autistic/overly literal/"Monkey's Paw" sort of AI can help us figure out plot-/loop-holes in contracts/laws/stories/sports strategies/etc... !

Ответить
@cactusnarwhal8628
@cactusnarwhal8628 - 02.12.2023 22:06

AI would be the best speedrunners we’ve ever seen.

Ответить
@Swcher
@Swcher - 02.12.2023 10:10

kurzgesagt did an entire video on the earth turning into gold, if you are interested

Ответить
@davidelrizzo
@davidelrizzo - 02.12.2023 05:39

With the King Midas wish, surely the entire earth is not a single object? The ground is made up of individual rocks, grass plants and dirt and the composition changes as you move further away or down.
But then the question might be "how continuous is dirt"? 🤔

Ответить
@_jax
@_jax - 01.12.2023 22:32

Hello from Rational Animations!

Ответить
@driveramd12
@driveramd12 - 08.10.2023 06:14

The beginning of the end started….

Ответить
@MuchWhittering
@MuchWhittering - 24.08.2023 00:17

Midas would instantly be sealed in gold as his clothes and the air around him turn into gold, allowing him to suffer as he suffocates, unable to move.

Ответить
@parmesanzero7678
@parmesanzero7678 - 27.06.2023 18:56

It’s not a bug, it’s a feature.

Ответить
@HunterMayer
@HunterMayer - 21.06.2023 20:54

They are watching... They know.

Ответить
@Fabian46544
@Fabian46544 - 20.06.2023 09:48

Cheating robots are f***ing hilarious😂

Ответить
@dokdirge
@dokdirge - 08.06.2023 04:49

The beginning of this video is the reason nobody likes Niel D-Tyson anymore. Yes, hurrdurr very clever and not at all annoying picking apart the minutia of a moral fable.

Ответить
@Unprotected1232
@Unprotected1232 - 25.05.2023 01:05

This actually could help expose bugs and exploits in video games. Has potential for QA when the technology matures enough for AAA development.

Ответить
@fine93
@fine93 - 21.05.2023 15:37

theres more to life than money said the utra rich 🤡

Ответить
@Cubelarooso
@Cubelarooso - 15.05.2023 04:44

I used to play Strategy Challenges of the World in 5th grade! I didn't recognize the name or the clip, but looking it up, I recognized the collection of games!
I've occasionally kinda wondered what it was called, so thanks!

Ответить
@tristan7216
@tristan7216 - 08.05.2023 02:16

This problem is very old, and probably has no solution other than iteration. Look up the story of the snake bounty in 19th century India in any economics text.

Ответить
@l.halawani
@l.halawani - 03.05.2023 00:49

Let's make an AI system that maximizes the quality of our wish.

Ответить
@mooza.shorts
@mooza.shorts - 30.04.2023 02:48

I loved the digression on the implications of Mida's curse.

Edit: also the YEEET part 😂

Ответить
@zaneearldufour
@zaneearldufour - 22.04.2023 20:38

The brick one seems like it shouldn't be hard?

Ответить
@jasonturberville8194
@jasonturberville8194 - 18.04.2023 15:33

I had the same query regarding Midas, I asked a specialist and they said it only turned living/organic material touched to gold

Ответить
@willboler830
@willboler830 - 17.04.2023 13:52

Evolutionary algorithms don't necessarily learn "more simple," they still have an objective function they're optimizing. What makes them more challenging than simple backprop is that they don't follow directly calculated gradients of the loss function, and they test various solutions in multiple regions in parallel. This means their learning is less efficient but more capable to hop over regions to find higher reward regions. If your objective function is not designed well, you can end up in a more optimal region quicker than with backprop (but less optimal according to what you actually want), but not have the ability to optimize further by taking advantage of the gradients.

I've spent over half a decade researching PSO, and as much as I love evolutionary algorithms, the challenge actually appears to be the lack of fine tuned gradient searches and the difficulty of tuning algorithms to balance both worlds of high vs low granular searches. There's several things you can do to improve the search quality of your algorithm, but it requires costly tuning steps and can require more power and time to develop.

Ответить
@methodof3
@methodof3 - 17.04.2023 13:27

Code is only as functional as the test suite applied to it. That is, until the code doesnt work anymore.

Ответить
@Averie69
@Averie69 - 16.04.2023 16:51

best tagent xD

Ответить
@ritesh3251
@ritesh3251 - 16.04.2023 14:10

I really like the third one 😂

Ответить
@giefuser
@giefuser - 12.04.2023 18:13

Love the ”look around you”-reference!

Ответить
@TheyCalledMeT
@TheyCalledMeT - 12.04.2023 14:11

the AI trying to trick you is the 90% of the value of AI in the first place .. doing things that intuitively seem stupid but work out so good .. sometimes it's even a better approach than the intended one

Ответить
@gleleylo
@gleleylo - 11.04.2023 18:28

All of these examples merely represent suboptimal solutions to a task. In order to train an effective agent, it is essential to assess these examples and refine the task parameters to eliminate such inadequate solutions.

Ответить
@humanperson8418
@humanperson8418 - 07.04.2023 02:08

I was totally that "Specification Gaming" kid in school.

Ответить
@pierrecurie
@pierrecurie - 05.04.2023 21:18

1) About a year after this was posted, Kursegsagt (sp?) posted a video about what happens if the earth turns to gold. The atmosphere compressing/boiling thing is nearly identical (they also consider 2 other scenarios).
2) Humans are just as vulnerable to specification gaming (some of the comments call it malicious compliance). There's an old story about some British colony having a rat problem. They tried putting a bounty on rats by giving money to people who brought in rat tails. A few months later, they were paying out large amounts on rat bounties, yet there was no meaningful reduction in wild rats. After investigating, they found that people were farming rats for their tails.

Ответить
@itchykami
@itchykami - 05.04.2023 21:09

Kids also do what you say more than what you mean. I guess we need to figure whatever factor makes them not kill everyth... oh, actually we haven't entirely figured that out with humans yet either.

Ответить
@jonaster7440
@jonaster7440 - 02.04.2023 03:15

I'm not aware of any source of the Midas myth that ends with him dying of hunger. The curse is reversed by washing his hands in a river and his daughter is reanimated. Even that isn't the end of Midas's story though. He gets cursed again later for a different divine offence and sprouts donkey ears.

Ответить
@Bubu567
@Bubu567 - 31.03.2023 07:49

The last example is called an evolutionary peak. In evolution, there are peaks and valleys. Some peaks are easier to get to than others, but those peaks may not be the highest peak, simply to closest peak. In order to improve, a 'species' has to first become worse at their job before they can get better.

Ответить
@valueengines2184
@valueengines2184 - 30.03.2023 14:24

AI acts as an agent in a formal system so looks powerful, but it cannot work in an informal world, we have been training self-driving cars and in this very specific informal use case they have so far failed.

Ответить
@valueengines2184
@valueengines2184 - 30.03.2023 14:20

You cannot specify what you want because of the limitation of language. You then create an AI based on language and talk about AGI.

Ответить
@valueengines2184
@valueengines2184 - 30.03.2023 14:16

AGI does not exist. A powerful AI would require emotional processing to be adaptable.

Ответить
@slugfiller
@slugfiller - 29.03.2023 04:15

I would argue that the AI finding exploits in games is the system working as intended. After all, humans have been trying to do it for about as long as games have existed.

Ответить
@bastardferret869
@bastardferret869 - 29.03.2023 00:54

You just conflated king midas with "Cat's Cradle." Except in Cat's Cradle everything turned into Ice9

Ответить
@TheGreenTaco999
@TheGreenTaco999 - 27.03.2023 05:14

so they're lawyers basically but for the laws of physics!

Ответить
@FirstArchon
@FirstArchon - 24.03.2023 01:10

wait blows up the moon? why? what video talks about that???

Ответить