Fine-tuning GPT-2 from human preferences

تكنولوجيا

OpenAI Blog

2019/09/19 - 07:00 518 مشاهدة

تحليل ذكي | AI Editorial Analysis

•We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human labelers, though those preferences did not...

•Specifically, for summarization tasks the labelers preferred sentences copied wholesale from the input (we’d only asked them to ensure accuracy), so our models learned to copy.

•Summarization required 60k human labels; simpler tasks which continue text in various styles required only 5k.

هذا الخبر من OpenAI Blog. خبر يقدم أدوات ذكاء اصطناعي للتلخيص والترجمة والاستماع.

We’ve fine-tuned the 774M parameter GPT-2 language model using human feedback for various tasks, successfully matching the preferences of the external human labelers, though those preferences did not always match our own. Specifically, for summarization tasks the labelers preferred sentences copied wholesale from the input (we’d only asked them to ensure accuracy), so our models learned to copy. Summarization required 60k human labels; simpler tasks which continue text in various styles required only 5k. Our motivation is to move safety techniques closer to the general task of “machines talking to humans,” which we believe is key to extracting information about human values.

المصدر: OpenAI Blog | Source: OpenAI Blog

ملاحظة تحريرية | Editorial Note: نُشر هذا المقال في الأصل بواسطة OpenAI Blog. خبر (Khabr) هي منصة إعلامية أردنية مرخّصة تعمل بالذكاء الاصطناعي. نضيف قيمة تحريرية من خلال: تحليل ذكي للأخبار، ملخصات تلقائية، رواية صوتية بالذكاء الاصطناعي، ترجمة متعددة اللغات، وتدقيق الحقائق. هدفنا جعل الأخبار أكثر وضوحاً وسهولةً للقارئ العربي.

This article was originally published by OpenAI Blog. Khabr is a licensed Jordanian AI-powered news platform (Registration #82086). We add editorial value through: AI-powered news analysis, automated summaries, AI audio narration, multi-language translation (Arabic, English, French, Turkish), and AI fact-checking. Our mission is to make news more accessible and understandable for Arabic-speaking audiences worldwide.

قراءة المقال الأصلي

المزيد عن تكنولوجيا | More on Technology

هذا الخبر ضمن تغطية خبر لقسم تكنولوجيا. نقدّم لك تحليلات ذكية وملخصات يومية لأهم الأخبار من مصادر موثوقة متعددة. المصدر: OpenAI Blog. يوجد 6 مقالات مرتبطة بهذا الموضوع.

This article is part of Khabr's coverage of Technology. We provide AI-powered analysis, summaries, and multi-source aggregation to keep you informed. Source: OpenAI Blog. Tags: AI, GPT-2, machine learning.

Fine-tuning GPT-2 from human preferences

المزيد عن تكنولوجيا | More on Technology

مقالات ذات صلة

عطل مفاجئ يضرب منصة فيسبوك لدى بعض المستخدمين

المواجهة المثيرة: تسلا رودستر 2026 تتحدى ريماتش نيفيرا في سباق السرعة الكهربائية!

Electric Showdown: Tesla Roadster 2026 vs Rimac Nevera – Which Hypercar Reigns Supreme?

وزارة العدل تواصل التوسع الرقمى بافتتاح مكتب توثيق بنك مصر بالتجمع الخامس

Victoria announces new social media ‘demasking’ powers for accounts accused of vilification

ذكاء اصطناعي.. كيمي كا3 الصيني ينافس أوبن أيه.آي وأنثروبيك