Learning from human preferences

علوم

OpenAI Blog

2017/06/13 - 07:00 516 مشاهدة

تحليل ذكي | AI Editorial Analysis

•One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to...

•In collaboration with DeepMind’s safety team, we’ve developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better.

هذا الخبر من OpenAI Blog. خبر يقدم أدوات ذكاء اصطناعي للتلخيص والترجمة والاستماع.

One step towards building safe AI systems is to remove the need for humans to write goal functions, since using a simple proxy for a complex goal, or getting the complex goal a bit wrong, can lead to undesirable and even dangerous behavior. In collaboration with DeepMind’s safety team, we’ve developed an algorithm which can infer what humans want by being told which of two proposed behaviors is better.

المصدر: OpenAI Blog | Source: OpenAI Blog

ملاحظة تحريرية | Editorial Note: نُشر هذا المقال في الأصل بواسطة OpenAI Blog. خبر (Khabr) هي منصة إعلامية أردنية مرخّصة تعمل بالذكاء الاصطناعي. نضيف قيمة تحريرية من خلال: تحليل ذكي للأخبار، ملخصات تلقائية، رواية صوتية بالذكاء الاصطناعي، ترجمة متعددة اللغات، وتدقيق الحقائق. هدفنا جعل الأخبار أكثر وضوحاً وسهولةً للقارئ العربي.

This article was originally published by OpenAI Blog. Khabr is a licensed Jordanian AI-powered news platform (Registration #82086). We add editorial value through: AI-powered news analysis, automated summaries, AI audio narration, multi-language translation (Arabic, English, French, Turkish), and AI fact-checking. Our mission is to make news more accessible and understandable for Arabic-speaking audiences worldwide.

قراءة المقال الأصلي

المزيد عن علوم | More on Science

هذا الخبر ضمن تغطية خبر لقسم علوم. نقدّم لك تحليلات ذكية وملخصات يومية لأهم الأخبار من مصادر موثوقة متعددة. المصدر: OpenAI Blog. يوجد 6 مقالات مرتبطة بهذا الموضوع.

This article is part of Khabr's coverage of Science. We provide AI-powered analysis, summaries, and multi-source aggregation to keep you informed. Source: OpenAI Blog. Tags: experience replay, reinforcement learning, AI.

Learning from human preferences

المزيد عن علوم | More on Science

مقالات ذات صلة

بحث نانوي متكامل لمكافحة تعقد الجذور بمحصول الطماطم يفوز بجائزة الإبتكار الزراعي

زلزال بقوة 6.05 درجة يضرب جزيرة مينداناو جنوب الفلبين

Astronomers Discover New Stars: A Surprising Breakthrough in Stellar Research

العين أكثر ذكاء مما نعتقد!.. اكتشاف شبكة خفية تعزز الرؤية الليلية

US, Russian astronauts launch into orbit for joint space mission

Canicule : Trois réacteurs nucléaires encore à l’arrêt en France en raison de la chaleur