feat: Added talks

cowolff · cowolff · commit 2d2e9e5c5677 · 2026-01-15T11:07:19.000+01:00
diff --git a/_pages/seminar_talks/202600.md b/_pages/seminar_talks/202600.md
@@ -0,0 +1,24 @@
+<h4><a href="https://www.daniel-gomm.com/" target="blank">Daniel Gomm</a>, CWI & University of Amsterdam</h4>
+
+23rd January, 4:00pm, room L302, CWI, Amsterdam Science Park, in-person talk and streamed through Zoom
+
+<details><summary>Unfold Bio</summary>
+    <p>
+    Daniel Gomm is a PhD student at the Table Representation Learning Lab within the Database Architectures Group at Centrum Wiskunde & Informatica (CWI) and the Information Retrieval Lab at the University of Amsterdam. His research lies at the intersection of generative AI, information retrieval, and structured data, with the goal of democratizing access to insights from tables, relational databases, and spreadsheets by contextualizing generative methods in structured data settings. He brings an interdisciplinary background spanning engineering, economics, and computer science, with experience across industry, academia, and policy.
+    </p>
+</details>
+<br>
+<b>"Are We Asking the Right Questions? On Ambiguity in Natural Language Queries for Tabular Data Analysis"</b>
+<br>
+
+<p>
+Natural language interfaces for tabular data analysis must contend with ambiguity in user queries. Rather than treating ambiguity as a flaw to be eliminated, this talk argues that ambiguity is often an intentional and productive aspect of user–system interaction.
+<br>
+Daniel presents a principled framework that conceptualizes analytical queries through the lens of cooperative interaction, distinguishing between unambiguous queries, cooperative but underspecified queries that systems can reasonably resolve, and uncooperative queries that lack sufficient information for any actionable interpretation. The framework is grounded in linguistic theory and formalizes how responsibility for query specification is shared between user and system.
+<br>
+Applying this framework, the talk analyzes queries from 15 widely used benchmarks for tabular question answering, text-to-SQL, and data analysis. The results reveal that current datasets conflate different query types, undermining meaningful evaluation of both execution accuracy and interpretation capabilities. The talk concludes with implications for designing more realistic benchmarks and for building tabular data systems that explicitly support cooperative grounding, selective inference, and iterative clarification.
+</p>
+
+<p>
+<b>Join the seminar remotely via Zoom:</b> <a href="https://cwi-nl-zoom.zoom.us/j/86289891036?pwd=5DGnyc3jpaucipnIjpEVa9wdGEISM1.1" target="_blank">https://cwi-nl-zoom.zoom.us/j/86289891036?pwd=5DGnyc3jpaucipnIjpEVa9wdGEISM1.1</a>
+</p>
diff --git a/_pages/seminar_talks/202601.md b/_pages/seminar_talks/202601.md
@@ -0,0 +1,24 @@
+<h4><a href="https://cornelius-wolff.de/" target="blank">Cornelius Wolff</a>, CWI & University of Amsterdam</h4>
+
+23rd January, 4:15pm, room L302, CWI, Amsterdam Science Park, in-person talk and streamed through Zoom
+
+<details><summary>Unfold Bio</summary>
+    <p>
+    Cornelius Wolff is a PhD researcher at the TRL Lab at Centrum Wiskunde & Informatica (CWI) and the University of Amsterdam, supervised by Madelon Hulsebos and Maarten de Rijke. His main focus is the autonomous retrieval of relevant insight from structured data in realistic settings, with an emphasis on scalable and interpretable AI systems for databases and tabular data. He is also interested in text-to-SQL, small and efficient language models and in-context learning beyond natural language.
+    </p>
+</details>
+<br>
+<b>"SQALE: Scaling Text-to-SQL with Realistic Database Schemas"</b>
+<br>
+
+<p>
+Natural language interfaces for databases rely on text-to-SQL models that can translate user questions into executable SQL queries. While recent advances in large language models have significantly improved performance, progress remains constrained by the limited scale, diversity, and realism of available training data.
+<br>
+In this talk, Cornelius presents <i>SQALE</i>, a large-scale semi-synthetic text-to-SQL dataset grounded in real-world database schemas. SQALE comprises over 517,000 validated (question, schema, query) triples built on 135,000+ relational schemas derived from SchemaPile. The dataset is constructed using a principled generation pipeline that combines schema extension, natural language question synthesis, and SQL generation with execution-based validation.
+<br>
+The talk will discuss the design criteria behind SQALE, its statistical properties in comparison to existing benchmarks such as Spider 2.0 and BIRD, and how SQALE enables more realistic training and evaluation of text-to-SQL models.
+</p>
+
+<p>
+<b>Join the seminar remotely via Zoom:</b> <a href="https://cwi-nl-zoom.zoom.us/j/86289891036?pwd=5DGnyc3jpaucipnIjpEVa9wdGEISM1.1" target="_blank">https://cwi-nl-zoom.zoom.us/j/86289891036?pwd=5DGnyc3jpaucipnIjpEVa9wdGEISM1.1</a>
+</p>
diff --git a/_pages/seminar_talks/202602.md b/_pages/seminar_talks/202602.md
@@ -0,0 +1,22 @@
+<h4><a href="https://www.linkedin.com/in/erkan-karabulut/" target="blank">Erkan Karabulut</a>, University of Amsterdam</h4>
+
+23rd January, 4:30pm, room L302, CWI, Amsterdam Science Park, in-person talk and streamed through Zoom
+
+<details><summary>Unfold Bio</summary>
+    <p>
+    Erkan is a PhD student at the INtelligent Data Engineering Lab (INDElab), University of Amsterdam. His research focuses on knowledge discovery and interpretable inference via rule learning on tabular data, including sensor data, with and without structured background knowledge (knowledge graphs) or prior knowledge from foundation models. 
+</p>
+</details>
+<br>
+<b>"Scalable Knowledge Discovery from Tabular Data"</b>
+<br>
+
+<p>
+Discovering patterns from data in human-understandable forms is a valuable task for both knowledge discovery and interpretable inference. A prominent method is Association Rule Mining (ARM), which identifies patterns in the form of logical rules describing relationships between data attributes. Popular ARM methods, however, rely on algorithmic or optimization-based solutions that struggle to scale to high-dimensional datasets (i.e., tables with many columns) without effective search space reduction.
+<br>
+This talk introduces Aerial+, a novel ARM method that leverages neural networks’ ability to handle high-dimensional data to learn a concise set of prominent patterns from tabular datasets. Aerial+ has been evaluated on both digital twin datasets (sensor data enriched with semantics) and on generic tabular datasets, demonstrating its versatility across domains. In addition, Aerial+ can incorporate prior knowledge to enhance discovery: either from knowledge graphs (structured semantic information about a domain) or from tabular foundation models, large pre-trained neural networks that capture table semantics and support diverse downstream tasks.
+</p>
+
+<p>
+<b>Join the seminar remotely via Zoom:</b> <a href="https://cwi-nl-zoom.zoom.us/j/86289891036?pwd=5DGnyc3jpaucipnIjpEVa9wdGEISM1.1" target="_blank">https://cwi-nl-zoom.zoom.us/j/86289891036?pwd=5DGnyc3jpaucipnIjpEVa9wdGEISM1.1</a>
+</p>
diff --git a/_pages/trl-seminar.md b/_pages/trl-seminar.md
@@ -9,6 +9,15 @@ profiles:
   - align: right
     content: seminar_talks/info.md
 
+  - align: right
+    content: seminar_talks/202600.md
+
+  - align: right
+    content: seminar_talks/202601.md
+
+  - align: right
+    content: seminar_talks/202602.md
+
   - align: right
     content: seminar_talks/past-talks.md