Investigación

Nuestro trabajo se basa en la investigación previa de nuestros miembros en Grandes modelos de lenguaje para código y Aprendizaje por refuerzo, reflejando una profunda experiencia en el ámbito académico e industrial.

CodeBERT: A Pre-Trained Model for Programming and Natural Languages

Seed-Coder: Let the Code Model Curate Data for Itself

Artículos

2025

FullStack Bench: Evaluating LLMs as Full Stack Coders

Code LLM benchmark

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Code LLM benchmark

Seed1.5-Thinking: Advancing Superb Reasoning Models with Reinforcement Learning

Seed-Coder: Let the Code Model Curate Data for Itself

Is PRM Necessary? Problem-Solving RL Implicitly Induces PRM Capability in LLMs

LLM-Powered Test Case Generation for Detecting Tricky Bugs

Focused-dpo: Enhancing code generation through focused preference optimization on error-prone points

Code Post-training

CodeDPO: Aligning Code Models with Self Generated and Verified Source Code

Code Post-training

SEAlign: Alignment training for software engineering agent

2024

Deep learning for code generation: a survey

Poison Attack and Poison Detection on Deep Source Code Processing Models

Selene: Pioneering Automated Proof in Software Verification

TrickyBugs: A Dataset of Corner-case Bugs in Plausible Programs

Code LLM benchmark

HITS: High-coverage LLM-based Unit Test Generation via Method Slicing

Codeagent: Enhancing code generation with tool-integrated agent systems for real-world repo-level coding challenges

HiRoPE: Length Extrapolation for Code Models Using Hierarchical Position

2023

GPT-Fathom: Benchmarking Large Language Models to Decipher the Evolutionary Path towards GPT-4 and Beyond

Retrieval-generation synergy augmented large language models

Improved Visual Story Generation with Adaptive Context Modeling

Who Judges the Judge: An Empirical Study on Online Judge Tests

Self-edit: Fault-aware code editor for code generation

2022

Toolcoder: Teach code generation models to use api search tools

Towards Robustness of Deep Program Processing Models—Detection, Estimation, and Enhancement

Pretraining without Wordpieces: Learning Over a Vocabulary of Millions of Words

Foundation Model

One model, multiple modalities: A sparsely activated approach for text, sound, image, video and code

Foundation Model

2021

Revisiting Iterative Back-Translation from the Perspective of Compositional Generalization

Iterative Utterance Segmentation for Neural Semantic Parsing

2020

Hierarchical Poset Decoding for Compositional Generalization in Language

Fact-aware Sentence Split and Rephrase with Permutation Invariant Training

Generating Adversarial Examples for Holding Robustness of Source Code Processing Models

CodeBERT: A Pre-Trained Model for Programming and Natural Languages

Learning to represent programs with heterogeneous graphs

2019

Generating Fluent Adversarial Examples for Natural Languages