Navigation X
ALERT
Click here to register with a few steps and explore all our cool stuff we have to offer!



   346

AI is officially SELF-IMPROVING -- now what?

by Frostflood - 31 December, 2024 - 09:09 PM
This post is by a banned member (Frostflood) - Unhide
165
Posts
24
Threads
#1
"We show that STaR (Self-Taught Reasoner) significantly improves
performance on  multiple datasets compared to a model fine-tuned
to directly predict final answers, and performs comparably to fine-tuning
  a 30× larger state-of-the-art language model on CommensenseQA.

Thus, STaR lets a model improve itself by learning from its own generated reasoning."


Well, it's finally here. Self-improving and self-teaching AI has hit a major inflection point. We saw glimpses of things like this with previous generative pre-trained models at the beginning of the attention transformer explosion. But we've never seen it to this degree and especially at this scale. A big critique of AI for a long time has been it lacks common sense or real-world understanding. It appears this is the solution. As the paper explains it is utilizing what are basically "in-between steps" before outputting the final answer -- much like human reasoning. This is also how neural-networks operate in general, leveraging a high number of "hidden neuron layers" between input and output. It seems we've discovered something pretty major here and it's obvious with the new improvements in recently developed models like OpenAI's o3.  

[Image: keiLVYC.png]

While it had some ability to reason before, we've stumbled onto something new that is not only capable of exponentially more powerful and accurate reasoning it also is capable of logarithmically improving using self-generated "synthetic data."

"In this paper, we adopt a different approach: by leveraging the LLM’s pre-existing reasoning ability,
we iteratively bootstrap the ability to generate high-quality rationales. Specifically, we few-shot
prompt a large language model to self-generate rationales and refine the model’s ability further by
fine-tuning on those rationales that lead to correct answers. We repeat this procedure, using the
improved model to generate the next training set each time. This is a synergistic process, where
improvements in rationale generation improve the training data, and improvements in training data
further improve rationale generation."


So where do we go from here? Is this the point when AI becomes a superintelligence? Is Terminator coming in 2025? Are we going to cure cancer and solve world hunger?
What do you think?

Either way, seems we're in for a wild ride.

[Image: hackerman2.gif]
This post is by a banned member (Eminem) - Unhide
Eminem  
Trial Moderator
2.998
Posts
568
Threads
Staff Team
4 Years of service
#2
Skynet is real  monkayes
[Image: scQLGRB.gif]
23/09
[Image: nR3sNdA.gif]
20/09

[Image: A0gE4KJ.gif]
[Image: gHprB92.gif]
This post is by a banned member (Frostflood) - Unhide
165
Posts
24
Threads
#3
(31 December, 2024 - 09:16 PM)Eminem Wrote: Show More
Skynet is real  [Image: monkayes.gif]

You think we'll see something like that in our lifetime?
This post is by a banned member (unauthorised) - Unhide
397
Posts
58
Threads
#4
sigma
This post is by a banned member (Frostflood) - Unhide
165
Posts
24
Threads
#5
(01 January, 2025 - 01:54 AM)unauthorised Wrote: Show More
sigma

ligma
This post is by a banned member (Frostflood) - Unhide
165
Posts
24
Threads
Bumped #6
This is a bump

Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
or
Sign in
Already have an account? Sign in here.


Forum Jump:


Users browsing this thread: 1 Guest(s)