Coder Social home page Coder Social logo

liondw / signal-alignment Goto Github PK

View Code? Open in Web Editor NEW
17.0 2.0 0.0 26.37 MB

An initiative to create concise and widely shareable educational resources, infographics, and animated explainers on the latest contributions to the community AI alignment effort. Boosting the signal and moving the community towards finding and building solutions.

ai ai-alignment design education

signal-alignment's People

Contributors

liondw avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

signal-alignment's Issues

[Signal-Alignment] Articulating Risks of AI project

Part of the purpose of the signal alignment project is to help everyone play catch up on the wider alignment issue.

This is just the initial outline here - requires heavy compression

Simple AI generated presentations
https://gamma.app/public/4n03bbzmrgq8nrv
https://gamma.app/public/cxwkcy3ulil26f6

Articulating the Risks of AI project

(summary of the first half of the AGI unleashed video)
https://www.youtube.com/watch?v=fKgPg_j9eF0

  • Existential Risks
  • Social and Economic Turmoil
  • What are Autonomous AI?
  • Pace and scale of AI development feedback loop
    (advancement in one area is advancement in all) (leading to AGI)
  • The Control Problem (What is AI alignment)
  • Incomplete Solutions
  • Arguments for/ against AI alignment (an updated bad alignment take bingo could be turned into a shareable long scroll post for sharing on social media)

Also incorporate visualizations from the A.I. Dilemma talk ( This source has stronger citations, could be a better bridge to understanding for more official avenues)
https://www.youtube.com/watch?v=xoVJKj8lcNQ

  • Pace and scale of AI development feedback loop
    (advancement in one area is advancement in all) (leading to AGI)
  • Threats and Risks of powerful AI models
  • What is a GLLMM AI
  • 2024 will be the last human election
  • GLLMM have new emergent properties with increased scale
    (New languages, pace of theory of mind, arithmetic and maths, research level chemistry)
  • RLHF, and other alignment methods/ incomplete solutions ( a comment on the Swisse cheese approach?)
  • AI making stronger AI
  • more notes
  • [ ]

Image

[Heuristic Imperatives] Definition cards

Overview

  • Gather research and refine definitions based on sources
  • Create visual concepts and assets for later use
  • Create rough storyboards containing layouts and on-screen text
  • Create finalized storyboards with full assets
  • Publish and gather feedback

Heuristic Imperatives Cards:

Part 1: Definition

  • Title Card
  • What are the Heuristic Imperatives
  • Reduce Suffering
  • Increase Prosperity
  • Increase Understanding
  • Define Heuristic + examples
  • Define Imperatives + examples

Final Product in the repo
https://github.com/liondw/Signal-Alignment/blob/da86b1de16aae91bf9579a50720694c4db903936/Heuristic%20Imperatives%20Graphics/Heuristic%20Imperatives%20-%20Introduction.pdf

Give feedback into the discussions section, or to [email protected]

[Heuristic Imperatives] Animated Explainer

Overview

  • Gather research and use the definition cards to build the script outline
  • Generate and refine a short 1-2 minute script based on outline
  • Record a voice over for the script (using real voices or a text to speech model)
  • Animate a rough version using previously created definition cards
  • Publish and gather feedback

Further tasks

  • Implement feedback
  • Refine animation

DRAFT VIDEO
https://user-images.githubusercontent.com/130222960/231064311-ce8b08fb-7476-46bd-bd27-188cdfb686fd.mp4

Script outline [DRAFT]

Heuristic imperatives provide a framework for embedding ethical principles within autonomous AI systems. These principles serve as intrinsic motivations and a moral compass, guiding decision-making, learning, self-evaluation, and cognitive control.

The Three core heuristic imperatives for AI systems are: Reduce suffering in the universe, Increase prosperity in the universe, Increase understanding in the universe

By incorporating these principles into AI systems, we aim to create machines that are adaptable, context-sensitive, and capable of navigating the complexities and nuances of human values, beliefs, and experiences while maintaining ethical boundaries.

Traditional approaches to AI ethics can be inflexible and fail to account for the diverse and evolving nature of human values and contexts. Heuristic imperatives address this limitation and provide a more ethically responsible approach to AI.

Heuristic imperatives have the potential to address the control problem of AI, foster trust, and promote individual autonomy. They can be applied across various domains and are crucial for creating ethical AI systems that serve humanity's best interests.

[Signal-Alignment] Implementation cards

Through monitoring discussion online, it looks like there is a demand for more thorough examination of the application of the heuristic imperatives. On a personal note, I believe these principles are primarily an effort to provide a necessary foundation, but not a complete solution.
There are are the barriers that we need to overcome (collaboration, regulation, education/ awareness, active monitoring)

We Need a Plan project

(summary of the AGI Moloch video)
https://www.youtube.com/watch?v=8phmHSUDzRg

  • Ideal Attractor states
  • Define: Nash Equilibrium or Game Theory
  • Define: The Moloch theory or Race dynamics
  • Success Criteria (insert from timestamp 20:10)

I'm guessing this goes into recap territory?

Title card draft

Image

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.