أوبن إيه آي كودكس على شات جي بي تي: ثورة في تطوير البرمجيات باستخدام وكلاء الذكاء الاصطناعي

تستعد أداة Codex من OpenAI، المتاحة الآن عبر ChatGPT، لإحداث تحول في هندسة البرمجيات من خلال أتمتة المهام وتسريع التطوير وتحسين جودة التعليمات البرمجية. يمثل هذا الوكيل البرمجي السحابي قفزة كبيرة إلى الأمام تتجاوز الإكمال التلقائي البسيط، مما يمكّن المطورين من التركيز على النوايا عالية المستوى بينما يتولى الذكاء الاصطناعي التفاصيل المعقدة. يقدم هذا المنشور نظرة عامة شاملة عن Codex وقدراته وكيف يغير مشهد تطوير البرمجيات.

من الإكمال التلقائي إلى الترميز المستقل

إن تطور Codex من أداة إكمال تلقائي من الجيل التالي (مثل GitHub Copilot) إلى وكيل “ترميز” مستقل أمر رائع. يسمح هذا التحول للمطورين بالتعبير عن النتائج المرجوة بلغة طبيعية، مع تحديد Codex للتنفيذ الأمثل.

تم تحسين Codex-1، النموذج الذي يشغل هؤلاء الوكلاء، من خلال طلبات السحب الحقيقية، مع إعطاء الأولوية للتدقيق وأدلة الأسلوب واجتياز الاختبارات.
تمكن بنية الاستدلال الكامنة وراء Codex-1 من فهم وتنفيذ مهام الترميز المعقدة.

الوصول إلى Codex في ChatGPT

الوصول إلى Codex من خلال ChatGPT واضح ومباشر، مما يوفر تكاملاً سلسًا في مهام سير العمل الحالية.

الوصول: يتوفر رمز “Codex (تجريبي)” مخصص في الشريط الجانبي لـ ChatGPT.
تكامل GitHub: تتيح نقرة OAuth واحدة توصيل Codex بمستودعات GitHub، مما يسمح بقراءة/كتابة الوصول إلى المؤسسات أو المشاريع الشخصية المحددة.
تحديد المستودع: يمكن للمطورين اختيار مستودع وفرع معينين لـ Codex للعمل عليهما.
تكوين البيئة: يمكن تكوين متغيرات البيئة الاختيارية والأسرار وأوامر الإعداد، على غرار مهمة CI. يتم تثبيت المدققين والمشكلين مسبقًا ولكن يمكن تخصيصها.
قوالب المهام: تتوفر قوالب مهام محددة مسبقًا مثل “شرح البنية” و “البحث عن الاختبار الضعيف وإصلاحه” و “اقتراح مهام الصيانة”. يمكن أيضًا تقديم تعليمات مخصصة.
التنفيذ والمهام المتعددة: يمكن تشغيل مهام متعددة بالتوازي، حيث يتم تشغيل كل منها في جهاز افتراضي صغير خاص بها.
مراجعة النتائج: يشار إلى اجتياز الاختبارات بعلامات اختيار خضراء. يمكن الوصول إلى الاختلافات التفصيلية والشروح وسجلات العمل لكل مهمة.
التكرار: يمكن إنشاء طلبات السحب مباشرة، أو يمكن تقديم تعليمات متابعة لمزيد من التحسين.

إظهار قدرات Codex

توضح عدة أمثلة قدرة Codex على تسريع التطوير وتحسين مهام سير العمل للمراجعة وإصلاح مشكلات التعليمات البرمجية الطفيفة.

تسريع التطوير: يمكن لـ Codex إعداد سقالات المشروع بسرعة، مثل حزم Swift، مما يسمح للمطورين بالتركيز على بناء الميزات.
مراجعة مهام سير العمل: يبسط Codex مراجعة التعليمات البرمجية عن طريق إنشاء طلبات سحب يمكن للمطورين التحقق منها وتحسينها.
إصلاح المشكلات البسيطة: يعالج Codex الأخطاء الصغيرة ومشكلات جودة التعليمات البرمجية، مما يسمح للمطورين بتفويض هذه المهام والحفاظ على التركيز.
البحث عن الأخطاء أثناء العمل: يساعد Codex في مهام العمل العاجلة عن طريق تشخيص المشكلات بسرعة من آثار المكدس.

Codex مقابل o3: مقارنة

تسلط مقارنة بين Codex و o3 (من المفترض أنه نموذج ذكاء اصطناعي آخر) الضوء على دقة واكتمال Codex الفائقين في إصلاح الأخطاء. في مثال محدد يتضمن matplotlib، قام Codex بتحديد وتصحيح خطأ في تطبيع النافذة بشكل صحيح، بينما كان إخراج o3 غير مكتمل.

تضمن إصلاح Codex إزالة التعليمات البرمجية غير الضرورية وإضافة اختبار وحدة للتحقق من الصحة.

كيف يعمل Codex

تؤكد منهجية تدريب Codex على إكمال المهام بدلاً من احتمالية الرمز المميز.

يكتب الوكيل التعليمات البرمجية ويشغلها ويراقب نتائج الاختبار ويكافأ على الإكمال الناجح.
يدفع هذا التدريب الشامل Codex نحو سلوكيات مماثلة لمطور مبتدئ: إنشاء نصوص إعادة إنتاج وقراءة أخطاء التدقيق وإعادة محاولة الإصلاحات والالتزام بأدلة الأسلوب.

بيئة ورشة العمل السحابية

يعمل Codex داخل بيئة افتراضية مصغرة لكل مهمة، مما يوفر العزل والسلامة وقابلية التكاثر.

تتضمن البيئة الافتراضية المصغرة نظام الملفات الخاص بها ووحدة المعالجة المركزية وذاكرة الوصول العشوائي وسياسة الشبكة المقفلة.
يتم استنساخ المستودعات وحقن متغيرات البيئة وتثبيت أدوات المطور الشائعة مسبقًا.
تتيح هذه البيئة التوازي على نطاق واسع، مما يسمح بتشغيل مهام متعددة في وقت واحد.

يمكن لملف AGENTS.md اختياري تقديم تعليمات إلى Codex، بما في ذلك تخطيط المشروع وإجراءات الاختبار وتفضيلات نمط الالتزام والتفاصيل المفيدة الأخرى.

التوفر والخطط المستقبلية

يتوفر Codex حاليًا لمستخدمي ChatGPT Pro و Enterprise و Team، مع توقع وصول المستخدمين المجانيين والتعليميين قريبًا.

تشمل الخطط المستقبلية واجهة برمجة تطبيقات وتكامل في مسارات CI وتوحيد بين إصدارات CLI و ChatGPT.

خاتمة

يمثل Codex خطوة مهمة نحو مستقبل يتضمن فيه الترميز تنسيقًا عالي المستوى، مع معالجة الذكاء الاصطناعي للتنفيذ التفصيلي. من خلال السماح للمطورين بالتركيز على النية والتحقق من الصحة، يعد Codex بإحداث ثورة في مهام سير عمل تطوير البرمجيات وتعزيز تعاون أكبر بين الإنسان والوكيل.

المصدر: N/A

OpenAI’s Codex, now accessible through ChatGPT, is poised to transform software engineering by automating tasks, accelerating development, and enhancing code quality. This cloud-based software agent represents a significant leap beyond simple autocomplete, enabling developers to focus on high-level intent while the AI handles the intricate details. This post provides a comprehensive overview of Codex, its capabilities, and how it’s changing the software development landscape.

From Autocomplete to Autonomous Vibe Coding

Codex’s evolution from a next-generation autocomplete tool (like GitHub Copilot) to an autonomous “vibe coding” agent is remarkable. This shift allows developers to communicate their desired outcomes in natural language, with Codex determining the optimal implementation.

Codex-1, the model powering these agents, is fine-tuned on real pull requests, prioritizing linting, style guides, and passing tests.
The reasoning architecture behind Codex-1 enables it to understand and execute complex coding tasks.

Accessing Codex in ChatGPT

Accessing Codex through ChatGPT is straightforward, providing a seamless integration into existing workflows.

Access: A dedicated “Codex (beta)” icon is available in the ChatGPT sidebar.
GitHub Integration: A single OAuth click connects Codex to GitHub repositories, allowing read/write access to specified organizations or personal projects.
Repository Selection: Developers can choose a specific repository and branch for Codex to work on.
Environment Configuration: Optional environment variables, secrets, and setup commands can be configured, similar to a CI job. Linters and formatters are pre-installed but can be customized.
Task Templates: Pre-defined task templates like “Explain the architecture,” “Find and fix the flakey test,” and “Suggest maintenance chores” are available. Custom instructions can also be provided.
Execution and Multitasking: Multiple tasks can be launched in parallel, each running in its own micro-VM.
Reviewing Results: Passing tests are indicated with green checkmarks. Detailed diffs, explanations, and work logs are accessible for each task.
Iteration: Pull requests can be created directly, or follow-up instructions can be given for further refinement.

Demonstrating Codex’s Capabilities

Several examples showcase Codex’s ability to accelerate development, improve review workflows, and fix minor code issues.

Accelerating Development: Codex can quickly set up project scaffolding, such as Swift packages, allowing developers to focus on building features.
Reviewing Workflows: Codex streamlines code review by generating pull requests that developers can validate and refine.
Fixing Papercuts: Codex handles small bugs and code quality issues, allowing developers to delegate these tasks and maintain focus.
Finding Errors On-Call: Codex assists with urgent on-call tasks by quickly diagnosing issues from stack traces.

Codex vs. o3: A Comparison

A comparison between Codex and o3 (presumably another AI model) highlights Codex’s superior accuracy and completeness in bug fixing. In a specific example involving matplotlib, Codex correctly identified and fixed an error in window normalization, while o3’s output was incomplete.

Codex’s fix involved removing unnecessary code and adding a unit test for validation.

How Codex Works

Codex’s training methodology emphasizes task completion over token-level plausibility.

The agent writes code, runs it, observes test outcomes, and is rewarded for successful completion.
This end-to-end training pushes Codex towards behaviors similar to a junior developer: creating repro scripts, reading lint errors, retrying fixes, and adhering to style guides.

Cloud Workshop Environment

Codex operates within a micro-VM sandbox for each task, providing isolation, safety, and reproducibility.

The sandbox includes its own file system, CPU, RAM, and locked-down network policy.
Repositories are cloned, environment variables are injected, and common developer tools are pre-installed.
This environment enables parallelism at scale, allowing multiple tasks to be run simultaneously.

An optional AGENTS.md file can provide instructions to Codex, including project layout, test procedures, commit style preferences, and other helpful details.

Availability and Future Plans

Codex is currently available to ChatGPT Pro, Enterprise, and Team users, with access expected for free-tier and EDU users soon.

Future plans include an API, integration into CI pipelines, and unification between the CLI and ChatGPT versions.

Conclusion

Codex marks a significant step towards a future where coding involves high-level orchestration, with AI handling the detailed implementation. By allowing developers to focus on intent and validation, Codex promises to revolutionize software development workflows and foster greater human-agent collaboration.

Source: N/A

القائمة