
Connecting companies with top-tier remote developers
G2i is a specialized talent marketplace headquartered in Delray Beach, Florida, connecting companies with vetted software developers skilled in web, mobile, and cross-platform technologies, particularly React, React Native, and Node.js. With a focus on remote work, G2i serves clients ranging from st...
G2i offers competitive compensation, flexible remote work options, and a supportive environment for engineers to thrive. Employees enjoy a generous PT...
G2i fosters a remote-first culture that emphasizes trust and autonomy, allowing engineers to work from anywhere while focusing on delivering high-qual...
G2i Inc. is hiring a Software Engineer, AI to help train large-language models for code evaluation and training. You'll work primarily with Java and contribute to improving AI-generated code. This role requires 3+ years of software engineering experience.
You have 3+ years of professional software engineering experience in Java — your strong command of the language allows you to write and evaluate production-grade code effectively. You possess strong code-review instincts, enabling you to quickly spot logic errors, performance traps, and security issues. Your extreme attention to detail and excellent written communication skills are essential, as much of this role involves explaining why one approach is better than another. You enjoy reading documentation and language specs, thriving in an asynchronous, low-oversight environment. While constraint programming experience is a bonus, it is not required for this role. You are eager to learn and adapt, especially in the context of AI and reinforcement learning.
In this role, you will help train large-language models (LLMs) to write production-grade code across a wide range of programming languages. You will compare and rank multiple code snippets, explaining which is best and why. Your responsibilities will include repairing and refactoring AI-generated code for correctness, efficiency, and style. You will inject feedback, such as ratings, edits, and test results, into the reinforcement learning with human feedback (RLHF) pipeline, ensuring it runs smoothly. The end result of your work will be that the model learns to propose, critique, and improve code in a manner similar to how expert engineers would. You will generate code, have expert engineers rank, edit, and justify it, and convert that feedback into reward signals to tune the model toward code that is ready for production.
This position is fully remote, allowing you to work from anywhere. Compensation ranges from $30/hr to $70/hr, depending on your location and seniority. You will have the flexibility to work a minimum of 15 hours per week, with the option to increase up to 40 hours per week. This role is structured as a 1099 contract, providing straightforward impact without unnecessary fluff. If this sounds like a fit, we encourage you to apply even if your experience doesn't match every requirement.
Apply now or save it for later. Get alerts for similar jobs at G2i Inc..