Search Results - Programming Exercise Generation Benchmark

1

Loading…

A Survey Study on the State of the Art of Programming Exercise Generation Using Large Language Models by Frankford, Eduard, Hohn, Ingo, Sauerwein, Clemens, Breu, Ruth

ISSN: 2377-570X

Published: IEEE 29.07.2024

Published in Proceedings / Conference on Software Engineering Education & Training (29.07.2024)
“…This paper analyzes Large Language Models (LLMs) with regard to their programming exercise generation capabilities…”

Get full text

Conference Proceeding

Save to List

Saved in:
2

Loading…

JavaBench: A Benchmark of Object-Oriented Code Generation for Evaluating Large Language Models by Cao, Jialun, Chen, Zhiyong, Wu, Jiarong, Cheung, Shing-Chi, Xu, Chang

ISSN: 2643-1572

Published: ACM 27.10.2024

Published in IEEE/ACM International Conference on Automated Software Engineering : [proceedings] (27.10.2024)
“…Code generation benchmarks such as HumanEval are widely adopted to evaluate LLMs' capabilities…”

Get full text

Conference Proceeding

Save to List

Saved in:
3

Loading…

On the performance of large language models on introductory programming assignments by Raihan, Nishat, Goswami, Dhiman, Puspo, Sadiya Sayara Chowdhury, Siddiq, Mohammed Latif, Newman, Christian, Ranasinghe, Tharindu, Santos, Joanna C. S., Zampieri, Marcos

ISSN: 0925-9902, 1573-7675

Published: 16.08.2025

Published in Journal of intelligent information systems (16.08.2025)
“…) have led to the development of a new generation of Large Language Models (LLMs) trained on massive amounts of data…”

Get full text

Journal Article

Save to List

Saved in:
4

Loading…

Dataset of Program Source Codes Solving Unique Programming Exercises Generated by Digital Teaching Assistant by Demidova, Liliya A., Andrianova, Elena G., Sovietov, Peter N., Gorchakov, Artyom V.

ISSN: 2306-5729, 2306-5729

Published: Basel MDPI AG 01.06.2023

Published in Data (Basel) (01.06.2023)
“…This paper presents a dataset containing automatically collected source codes solving unique programming exercises of different types…”

Get full text

Journal Article

Save to List

Saved in:
5

Loading…

JavaBench: A Benchmark of Object-Oriented Code Generation for Evaluating Large Language Models by Cao, Jialun, Chen, Zhiyong, Wu, Jiarong, Shing-chi Cheung, Chang, Xu

ISSN: 2331-8422

Published: Ithaca Cornell University Library, arXiv.org 11.10.2024

Published in arXiv.org (11.10.2024)
“…Code generation benchmarks such as HumanEval are widely adopted to evaluate LLMs' capabilities…”

Get full text

Paper

Save to List

Saved in:
6

Loading…

Evaluating the quality of scenarios of short-term wind power generation by Pinson, P., Girard, R.

ISSN: 0306-2619, 1872-9118

Published: Elsevier Ltd 01.08.2012

Published in Applied energy (01.08.2012)
“… ► Guidelines for future evaluation/benchmark exercises. Scenarios of short-term wind power generation are becoming increasingly popular as input to multistage decision-making problems e.g…”

Get full text

Journal Article

Save to List

Saved in:
7

Loading…

DiffCoder: Enhancing Large Language Model on API Invocation via Analogical Code Exercises by Zan, Daoguang, Yu, Ailun, Shen, Bo, Chen, Bei, Li, Wei, Gong, Yongshun, Chen, Xiaolin, Yao, Yafen, Luo, Weihua, Guan, Bei, Liu, Yan, Wang, Yongji, Wang, Qianxiang, Cui, Lizhen

ISSN: 2994-970X, 2994-970X

Published: New York, NY, USA ACM 12.07.2024

Published in Proceedings of the ACM on software engineering (12.07.2024)
“…The task of code generation aims to generate code solutions based on given programming problems…”

Get full text

Journal Article

Save to List

Saved in:
8

Loading…

SEDGE: Symbolic example data generation for dataflow programs by Kaituo Li, Reichenbach, Christoph, Smaragdakis, Yannis, Diao, Yanlei, Csallner, Christoph

Published: IEEE 01.11.2013

Published in 2013 IEEE/ACM 28th International Conference on Automated Software Engineering (ASE) (01.11.2013)
“… Past work demonstrated effective ways to generate small example data sets that exercise operators in the Pig platform, used to generate Hadoop map-reduce programs…”

Get full text

Conference Proceeding

Save to List

Saved in:
9

Loading…

CSEPrompts: A Benchmark of Introductory Computer Science Prompts by Nishat Raihan, Goswami, Dhiman, Sadiya Sayara Chowdhury Puspo, Newman, Christian, Ranasinghe, Tharindu, Zampieri, Marcos

ISSN: 2331-8422

Published: Ithaca Cornell University Library, arXiv.org 04.04.2024

Published in arXiv.org (04.04.2024)
“…Recent advances in AI, machine learning, and NLP have led to the development of a new generation of Large Language Models (LLMs…”

Get full text

Paper

Save to List

Saved in:
10

Loading…

A Track-Based Conference Scheduling Problem by Riquelme, Fabian, Montero, Elizabeth, Pérez-Cáceres, Leslie, Rojas-Morales, Nicolás

ISSN: 2227-7390, 2227-7390

Published: Basel MDPI AG 01.11.2022

Published in Mathematics (Basel) (01.11.2022)
“…The scheduling of conferences is a challenging task that aims at creating successful conference programs that fulfill an often wide variety of requirements. In…”

Get full text

Journal Article

Save to List

Saved in:
11

Loading…

Single muscle fibre biomechanics and biomechatronics – The challenges, the pitfalls and the future by Friedrich, Oliver, Haug, Michael, Reischl, B, Prölß, G, Kiriaev, Leon, Head, Stewart I, Reid, Michael B

ISSN: 1357-2725, 1878-5875, 1878-5875

Published: Netherlands Elsevier Ltd 01.09.2019

Published in The international journal of biochemistry & cell biology (01.09.2019)
“… We review major standard systems available from research labs and commercial sources, and benchmark those to our recently developed automated MyoRobot biomechatronics platform that provides…”

Get full text

Journal Article

Save to List

Saved in:
12

Loading…

Coordinating self-sizing and self-repair managers for multi-tier systems by Gueye, Soguy Mak-Karé, De Palma, Noël, Rutten, Éric, Tchana, Alain, Berthier, Nicolas

ISSN: 0167-739X, 1872-7115

Published: Elsevier B.V 01.06.2014

Published in Future generation computer systems (01.06.2014)
“…Computing systems have become more and more distributed and heterogeneous, making their manual administration difficult and error-prone. The Autonomic…”

Get full text

Journal Article

Save to List

Saved in:
13

Loading…

Strategic Behaviour in a Capacity Market? The New Irish Electricity Market Design by Teirilä, Juha, Ritz, Robert A.

ISSN: 0195-6574, 1944-9089

Published: Los Angeles, CA International Association for Energy Economics 15.01.2019

Published in The Energy journal (Cambridge, Mass.) (15.01.2019)
“…The transition to a low-carbon power system requires growing the share of generation from (intermittent…”

Get full text

Journal Article

Save to List

Saved in:
14

Loading…

Evaluating and Aligning CodeLLMs on Human Preference by Yang, Jian, Yang, Jiaxi, Jin, Ke, Miao, Yibo, Zhang, Lei, Yang, Liqun, Cui, Zeyu, Zhang, Yichang, Binyuan Hui, Lin, Junyang

ISSN: 2331-8422

Published: Ithaca Cornell University Library, arXiv.org 06.12.2024

Published in arXiv.org (06.12.2024)
“… Most previous code-related benchmarks, which consist of various programming exercises along with the corresponding test cases, are used as a common measure to evaluate the performance and capabilities of code LLMs…”

Get full text

Paper

Save to List

Saved in:
15

Loading…

Swarm inspired test case generation for online C++ programming assessment by Oi-Mean Foong, Quang-Trung Tran, Suet-Peng Yong, Rais, Helmi Md

ISBN: 1479943916, 9781479943913

Published: IEEE 01.06.2014

Published in 2014 International Conference on Computer and Information Sciences (ICCOINS) (01.06.2014)
“… Moreover, they also need to define test cases for different programming exercises in order to assess students' code…”

Get full text

Conference Proceeding

Save to List

Saved in:
16

Loading…

Improving differential evolution through a unified approach by Padhye, Nikhil, Bhardawaj, Piyush, Deb, Kalyanmoy

ISSN: 0925-5001, 1573-2916

Published: Boston Springer US 01.04.2013

Published in Journal of global optimization (01.04.2013)
“…— Initialization, Selection, Generation and Replacement , which are sufficient to describe…”

Get full text

Journal Article

Save to List

Saved in:
17

Loading…

Generation of variational standard plant room solutions by Medjdoub, Benachir, Richens, Paul, Barnard, Nick

ISSN: 0926-5805

Published: Amsterdam Elsevier B.V 01.03.2003

Published in Automation in construction (01.03.2003)
“…We have used the object-based CAD programming to take advantage of standardisation to handle the selection, sizing, layout and (potentially…”

Get full text

Journal Article

Save to List

Saved in:
18

Loading…

MixFX-SCORE: Heterogeneous Fixed-Point Compilation of Dataflow Computations by Deheng Ye, Kapre, Nachiket

Published: IEEE 01.05.2014

Published in 2014 IEEE 22nd Annual International Symposium on Field-Programmable Custom Computing Machines (01.05.2014)
“… When designing circuits for reconfigurable hardware, we can exercise independent control over bitwidth selection of each variable in the computation…”

Get full text

Conference Proceeding

Save to List

Saved in:
19

Loading…

Test input reduction for result inspection to facilitate fault localization by Hao, Dan, Xie, Tao, Zhang, Lu, Wang, Xiaoyin, Sun, Jiasu, Mei, Hong

ISSN: 0928-8910, 1573-7535

Published: Boston Springer US 01.03.2010

Published in Automated software engineering (01.03.2010)
“…Testing-based fault-localization (TBFL) approaches often require the availability of high-statement-coverage test suites that sufficiently exercise the areas around the faults…”

Get full text

Journal Article

Save to List

Saved in:
20

Loading…

Health Club Industry Benchmarks Show Some Good and Some Bad by Scudder, Michael Scott

ISSN: 2150-2692, 2150-2706

Published: Washington Questex, LLC 03.08.2015

Published in Club Industry (03.08.2015)
“… Group exercise class attendance increases crested in early 2013 and-though still a healthy part of many clubs' programming-needs a shot of adrenaline and stronger promotion…”

Get full text

Trade Publication Article

Save to List

Saved in:

Search Results - Programming Exercise Generation Benchmark

A Survey Study on the State of the Art of Programming Exercise Generation Using Large Language Models by Frankford, Eduard, Hohn, Ingo, Sauerwein, Clemens, Breu, Ruth

JavaBench: A Benchmark of Object-Oriented Code Generation for Evaluating Large Language Models by Cao, Jialun, Chen, Zhiyong, Wu, Jiarong, Cheung, Shing-Chi, Xu, Chang

On the performance of large language models on introductory programming assignments by Raihan, Nishat, Goswami, Dhiman, Puspo, Sadiya Sayara Chowdhury, Siddiq, Mohammed Latif, Newman, Christian, Ranasinghe, Tharindu, Santos, Joanna C. S., Zampieri, Marcos

Dataset of Program Source Codes Solving Unique Programming Exercises Generated by Digital Teaching Assistant by Demidova, Liliya A., Andrianova, Elena G., Sovietov, Peter N., Gorchakov, Artyom V.

JavaBench: A Benchmark of Object-Oriented Code Generation for Evaluating Large Language Models by Cao, Jialun, Chen, Zhiyong, Wu, Jiarong, Shing-chi Cheung, Chang, Xu

Evaluating the quality of scenarios of short-term wind power generation by Pinson, P., Girard, R.

DiffCoder: Enhancing Large Language Model on API Invocation via Analogical Code Exercises by Zan, Daoguang, Yu, Ailun, Shen, Bo, Chen, Bei, Li, Wei, Gong, Yongshun, Chen, Xiaolin, Yao, Yafen, Luo, Weihua, Guan, Bei, Liu, Yan, Wang, Yongji, Wang, Qianxiang, Cui, Lizhen

SEDGE: Symbolic example data generation for dataflow programs by Kaituo Li, Reichenbach, Christoph, Smaragdakis, Yannis, Diao, Yanlei, Csallner, Christoph

CSEPrompts: A Benchmark of Introductory Computer Science Prompts by Nishat Raihan, Goswami, Dhiman, Sadiya Sayara Chowdhury Puspo, Newman, Christian, Ranasinghe, Tharindu, Zampieri, Marcos

A Track-Based Conference Scheduling Problem by Riquelme, Fabian, Montero, Elizabeth, Pérez-Cáceres, Leslie, Rojas-Morales, Nicolás

Single muscle fibre biomechanics and biomechatronics – The challenges, the pitfalls and the future by Friedrich, Oliver, Haug, Michael, Reischl, B, Prölß, G, Kiriaev, Leon, Head, Stewart I, Reid, Michael B

Coordinating self-sizing and self-repair managers for multi-tier systems by Gueye, Soguy Mak-Karé, De Palma, Noël, Rutten, Éric, Tchana, Alain, Berthier, Nicolas

Strategic Behaviour in a Capacity Market? The New Irish Electricity Market Design by Teirilä, Juha, Ritz, Robert A.

Evaluating and Aligning CodeLLMs on Human Preference by Yang, Jian, Yang, Jiaxi, Jin, Ke, Miao, Yibo, Zhang, Lei, Yang, Liqun, Cui, Zeyu, Zhang, Yichang, Binyuan Hui, Lin, Junyang

Swarm inspired test case generation for online C++ programming assessment by Oi-Mean Foong, Quang-Trung Tran, Suet-Peng Yong, Rais, Helmi Md

Improving differential evolution through a unified approach by Padhye, Nikhil, Bhardawaj, Piyush, Deb, Kalyanmoy

Generation of variational standard plant room solutions by Medjdoub, Benachir, Richens, Paul, Barnard, Nick

MixFX-SCORE: Heterogeneous Fixed-Point Compilation of Dataflow Computations by Deheng Ye, Kapre, Nachiket

Test input reduction for result inspection to facilitate fault localization by Hao, Dan, Xie, Tao, Zhang, Lu, Wang, Xiaoyin, Sun, Jiasu, Mei, Hong

Health Club Industry Benchmarks Show Some Good and Some Bad by Scudder, Michael Scott

Search Tools:

Refine Results

Format

Subject Area

Topic

Language

Year of Publication