Skip to content

Commit 02d3969

Browse files
committed
update
1 parent b073974 commit 02d3969

File tree

4 files changed

+27
-27
lines changed

4 files changed

+27
-27
lines changed

about.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
<header>
1111
<nav>
1212
[<a href="index.html">Home</a>]
13-
[<a href="#">Software Arena</a>]
13+
[<a href="#">SWE Arena</a>]
1414
</nav>
1515
</header>
1616

@@ -71,7 +71,7 @@ <h2>Sponsors</h2>
7171
Computer Intelligence Project is supported by donations from the following institutions: E2B, Hugging Face, IBM, and CSIRO's Data61.
7272
</p>
7373
<p>
74-
We also thank the following companies for providing API credits to serve their models on Software Arena: Alibaba, and IBM.
74+
We also thank the following companies for providing API credits to serve their models on SWE Arena: Alibaba, and IBM.
7575
</p>
7676
<p>
7777
We welcome diverse forms of donations and sponsorships, including but not limited to cash, computing devices (e.g., GPUs), and cloud credits. Please contact us at <a href="mailto:[email protected]"><u>[email protected]</u></a> to learn more about sponsorships and benefits.

arena.html

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -3,7 +3,7 @@
33
<head>
44
<meta charset="UTF-8">
55
<meta http-equiv="refresh" content="0; url=http://104.154.215.22:7860/">
6-
<title>Redirecting to Software Arena</title>
6+
<title>Redirecting to SWE Arena</title>
77
<script>
88
window.location.href = "http://104.154.215.22:7860/";
99
</script>
Lines changed: 22 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -3,36 +3,36 @@
33
<head>
44
<meta charset="UTF-8">
55
<meta name="viewport" content="width=device-width, initial-scale=1.0">
6-
<title>Software Arena - Computer Intelligence Project</title>
6+
<title>SWE Arena - Computer Intelligence Project</title>
77
<link rel="stylesheet" href="../styles.css">
88
</head>
99
<body>
1010
<header>
1111
<nav>
1212
[<a href="../index.html">Home</a>]
1313
[<a href="../about.html">About</a>]
14-
[<a href="../arena.html">Software Arena</a>]
14+
[<a href="../arena.html">SWE Arena</a>]
1515
</nav>
1616
</header>
1717

1818
<main>
19-
<h1>Software Arena</h1>
19+
<h1>SWE Arena</h1>
2020
<p>
21-
Software Arena extends Chatbot Arena with powerful code execution capabilities, enabling evaluation of LLM-generated programs across a wide range of outputs - from simple computations to complex visual interfaces.
21+
SWE Arena extends Chatbot Arena with powerful code execution capabilities, enabling evaluation of LLM-generated programs across a wide range of outputs - from simple computations to complex visual interfaces.
2222
</p>
2323

24-
<h2>What is Software Arena?</h2>
24+
<h2>What is SWE Arena?</h2>
2525
<p>
26-
Software Arena introduces a plug-and-play code execution environment for Chatbot Arena. It enables direct evaluation of LLM capabilities in:
26+
SWE Arena introduces a plug-and-play code execution environment for Chatbot Arena. It enables direct evaluation of LLM capabilities in:
2727
</p>
2828
<ul>
2929
<li>General-purpose code execution across multiple languages</li>
3030
<li>Output visualization ranging from text, images, to interactive UIs</li>
3131
</ul>
3232

33-
<h2>Why Software Arena?</h2>
33+
<h2>Why SWE Arena?</h2>
3434
<p>
35-
Software Arena is designed to address the limitations of Chatbot Arena, particularly in terms of precise code evaluation.
35+
SWE Arena is designed to address the limitations of Chatbot Arena, particularly in terms of precise code evaluation.
3636
Human judgement on code generation is not always reliable [<a href="https://arxiv.org/abs/2402.11296"><u>1</u></a>, <a href="https://arxiv.org/abs/2410.03837"><u>2</u></a>], and generally requires non-trivial knowledge of the language and its libraries.
3737
We consider this a significant limitation for the development of advanced AI systems.
3838
</p>
@@ -42,10 +42,10 @@ <h2>Why Software Arena?</h2>
4242
<a href="https://support.anthropic.com/en/articles/9487310-what-are-artifacts-and-how-do-i-use-them"><u>Claude Artifacts</u></a> by Anthropic is one of the first features in this space to let users interact with LLM-generated frontend applications.
4343
<a href="https://v0.dev/"><u>v0</u></a> by Vercel also allows users to ship LLM-generated frontend applications with frontend frameworks.
4444
Based on this, <a href="https://chatbotarena.ai/webdev"><u>WebDev Arena</u></a> by Chatbot Arena and <a href="https://www.llmcodearena.com/"><u>Code Arena</u></a> by Together AI focus on evaluating LLM-generated frontend applications.
45-
<strong>Software Arena aims to extend this capability to a wider range of outputs, not just frontend applications, but also programs that can be run on backend servers and data analysis.</strong>
45+
<strong>SWE Arena aims to extend this capability to a wider range of outputs, not just frontend applications, but also programs that can be run on backend servers and data analysis.</strong>
4646
</p>
4747
<h2>Supported Outputs</h2>
48-
<p>Software Arena can visualize various types of code execution outputs:</p>
48+
<p>SWE Arena can visualize various types of code execution outputs:</p>
4949
<ul>
5050
<li>Documents (Markdown or Plain Text)</li>
5151
<li>Websites (single HTML webpage)</li>
@@ -61,7 +61,7 @@ <h2>Supported Outputs</h2>
6161

6262
<h2>Technical Implementation</h2>
6363
<p>
64-
Software Arena builds upon FastChat, the foundation of Chatbot Arena, providing seamless code execution capabilities. The implementation focuses on:
64+
SWE Arena builds upon FastChat, the foundation of Chatbot Arena, providing seamless code execution capabilities. The implementation focuses on:
6565
</p>
6666
<ul>
6767
<li><strong>Code Execution:</strong> Secure, sandboxed environment using <a href="https://e2b.dev/"><u>E2B</u></a> for executing code in supported language (Python, JavaScript, etc.).</li>
@@ -73,7 +73,7 @@ <h2>Technical Implementation</h2>
7373

7474
<h2>Expected Outcomes</h2>
7575
<p>
76-
Software Arena aims to deliver several key outcomes:
76+
SWE Arena aims to deliver several key outcomes:
7777
</p>
7878
<ul>
7979
<li><strong>Leaderboard:</strong> A dynamic Elo rating system tracking LLM performance in execution-based code generation, providing transparent comparisons across different models.</li>
@@ -83,39 +83,39 @@ <h2>Expected Outcomes</h2>
8383

8484
<h2>Future Plans</h2>
8585
<p>
86-
Software Arena is currently in the early stage of development.
86+
SWE Arena is currently in the early stage of development.
8787
We plan to continuously add more features towards the goal of Computer Intelligence.
8888
</p>
8989
<p>
90-
Meanwhile, we are actively working with <a href="https://lmarena.ai/"><u>Chatbot Arena</u></a> to integrate Software Arena into their platform.
90+
Meanwhile, we are actively working with <a href="https://lmarena.ai/"><u>Chatbot Arena</u></a> to integrate SWE Arena into their platform.
9191
</p>
9292

9393
<h2>Frequently Asked Questions</h2>
9494
<div class="faq-section">
9595
<div class="faq-item">
96-
<div class="faq-question">Why is the code execution process of Software Arena a bit slow?</div>
96+
<div class="faq-question">Why is the code execution process of SWE Arena a bit slow?</div>
9797
<div class="faq-answer">
98-
Before code execution, Software Arena parses the code and installs various packages to ensure the code can be executed.
98+
Before code execution, SWE Arena parses the code and installs various packages to ensure the code can be executed.
9999
This is why the code execution process is a bit slow.
100100
</div>
101101
</div>
102102
<div class="faq-item">
103-
<div class="faq-question">What can not Software Arena do?</div>
103+
<div class="faq-question">What can not SWE Arena do?</div>
104104
<div class="faq-answer">
105-
Currently, Software Arena does not support programming languages other than JavaScript, TypeScript, HTML, and Python.
106-
In addition, Software Arena can not execute code that use desktop-level UIs (e.g., Tkinter, PyQt, etc.) or take user inputs from the keyboard.
105+
Currently, SWE Arena does not support programming languages other than JavaScript, TypeScript, HTML, and Python.
106+
In addition, SWE Arena can not execute code that use desktop-level UIs (e.g., Tkinter, PyQt, etc.) or take user inputs from the keyboard.
107107
</div>
108108
</div>
109109
<div class="faq-item">
110-
<div class="faq-question">How do I know if Software Arena will use my personal identifiable information (PII)?</div>
110+
<div class="faq-question">How do I know if SWE Arena will use my personal identifiable information (PII)?</div>
111111
<div class="faq-answer">
112-
While Software Arena collects the user input, we will redact the PII (e.g., API keys, etc.) by using <a href="https://huggingface.co/bigcode/starpii"><u>StarPII</u></a>, an NER model that trained on a large-scale code dataset that can identify and mask the PII.
112+
While SWE Arena collects the user input, we will redact the PII (e.g., API keys, etc.) by using <a href="https://huggingface.co/bigcode/starpii"><u>StarPII</u></a>, an NER model that trained on a large-scale code dataset that can identify and mask the PII.
113113
</div>
114114
</div>
115115
<div class="faq-item">
116116
<div class="faq-question">Can I contribute to the project?</div>
117117
<div class="faq-answer">
118-
Yes! Software Arena is an open-source project, and we welcome contributions. You can find our repository on GitHub and join our community through email. We appreciate help in various areas including development, testing, and documentation.
118+
Yes! SWE Arena is an open-source project, and we welcome contributions. You can find our repository on GitHub and join our community through email. We appreciate help in various areas including development, testing, and documentation.
119119
</div>
120120
</div>
121121
</div>

index.html

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
<header>
1111
<nav>
1212
[<a href="about.html">About</a>]
13-
[<a href="arena.html">Software Arena</a>]
13+
[<a href="arena.html">SWE Arena</a>]
1414
</nav>
1515
</header>
1616

@@ -36,7 +36,7 @@ <h1>Computer Intelligence Project</h1>
3636
<section>
3737
<h2>Projects</h2>
3838
<ul>
39-
<li>Software Arena: <a href="arena.html"><u>[Project]</u></a>, <a href="blog/software-arena.html"><u>[Blog]</u></a></li>
39+
<li>SWE Arena: <a href="arena.html"><u>[Project]</u></a>, <a href="blog/swe-arena.html"><u>[Blog]</u></a></li>
4040
</ul>
4141
</section>
4242

0 commit comments

Comments
 (0)