bigcode-project / bigcode-evaluation-harness Public

Notifications You must be signed in to change notification settings
Fork 219
Star 827

Code
Issues 52
Pull requests 28
Actions
Projects 1
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Pull requests: bigcode-project/bigcode-evaluation-harness

Labels 9 Milestones 6

New pull request New

28 Open 112 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Fix typos

#286 opened Nov 6, 2024 by gameofby

Loading…

add support for hpu devices

#281 opened Oct 25, 2024 by envsp

Loading…

"," missing in LANGUAGES list

#280 opened Oct 21, 2024 by ArtemisDicoTiar

Loading…

Speedup execute.py: Reuse same manager and dict in

#277 opened Oct 1, 2024 by michaelfeil

Loading…

Basecodes

#263 opened Aug 14, 2024 by Abhineetsoccer

Loading…

Add a new benchmark ENAMEL for evaluating the efficiency of LLM-generated code

#260 opened Jul 22, 2024 by q-rz

Loading…

Fix Max New Tokens in HF's Generation Config

#257 opened Jul 18, 2024 by mostafaelhoushi

Loading…

Fix unnecessary repeated overwrite

#249 opened Jun 29, 2024 by nielstron

Loading…

fix: Multiple-E dataset fix go_test.go path for test execution

#225 opened Apr 20, 2024 by hitesh-1997

Loading…

Add llama3 instruction prompts

#222 opened Apr 19, 2024 by TechxGenus

Loading…

Leaderboard README improvements

#217 opened Apr 14, 2024 by nikita1503

Loading…

remove pad tokens added by the accelerator.pad_across_processes

#216 opened Apr 13, 2024 by IQ17

Loading…

Ensure generations get saved in generation_only mode

#212 opened Mar 31, 2024 by Vipitis

Loading…

fix apps evaluate error: local variable 'level' referenced before assignment

#206 opened Mar 10, 2024 by koking0

Loading…

Update README.md

#204 opened Mar 2, 2024 by AnitaLiu98

Loading…

Fix loading PAL-GSM few-shot examples

#196 opened Feb 8, 2024 by sxjscience

Loading…

Make main.py compatible with OpenAI compatible APIs

#189 opened Jan 23, 2024 by hmellor

Loading…

Fix typo in README.md

#177 opened Jan 2, 2024 by ab-10

Loading…

[WIP] Shadereval tasks

#173 opened Dec 16, 2023 by Vipitis • Draft

1 of 4 tasks

Add support for Ollama, Palm, Claude-2, Cohere, Replicate, Llama2 CodeLlama (100+LLMs) [LiteLLM]

#160 opened Nov 9, 2023 by ishaan-jaff

Loading…

Dockerfile-multiple no longer fetches pip dependencies needlessly

#157 opened Nov 1, 2023 by RemcoSchrijver

Loading…

Adding additional optional args for decoding flags and AutoModel kwargs to support models like ReplitLM

#115 opened Jul 12, 2023 by madhavatreplit

Loading…

Support Seq2SeqLM model class (to facilitate the CodeT5+ models)

#104 opened Jun 26, 2023 by keyboardAnt

Loading…

Attempt to make MultiPl-E's evaluation parallelization over all completions at once rather than just over each problem.

#86 opened Jun 7, 2023 by esslushy

Loading…

Add: learning performance-improving code edits 🥧

#65 opened Apr 23, 2023 by SwayamInSync

Loading…

Previous 1 2 Next

Previous Next

ProTip! Exclude everything labeled bug with -label:bug.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly