Skip to content

Milestones

List view

  • ParlaMint version 5.0 release (done in scope of ParlaCAP project) The complete corpus will be availabe as follows: * http://hdl.handle.net/11356/2004 : Multilingual comparable corpora of parliamentary debates ParlaMint 5.0 * http://hdl.handle.net/11356/2005 : Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 5.0 * http://hdl.handle.net/11356/2006: Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 5.0 As opposed to 4.1, this version: * adds automatically assigned CAP topics to speeches in all corpora * adds automatically assigned sentiment score and labels to sentences in all corpora * adds automatically assigned sentiment score and labels to speeches in the SI corpus * improves some scripts, esp. to better handle parallelisation and processing huge corpora * corrects a few errors found in the 4.1 data, most notably giving corpus-unique IDs to corpus-specific taxonomy IDs.

    Due by July 15, 2025
    9/9 issues closed
  • Post-project maintenance release. The complete corpus is availabe as follows: * http://hdl.handle.net/11356/1912 : Multilingual comparable corpora of parliamentary debates ParlaMint 4.1 * http://hdl.handle.net/11356/1911 : Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.1 * http://hdl.handle.net/11356/1910: Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 4.1 As opposed to 4.0, this version: * corrects various found errors in 4.0 * restructures the ParlaMint GitHub repository * the DK has been linguistically re-annotated to remove various mistakes and its speeches are now also marked with topics * the PT corpus has been extended to 2024-03 * the UA corpus has been extended to 2023-11 and has improved language marking (uk vs. ru) on segments. has improved language marking (uk vs. ru) on segments.

    Due by June 1, 2024
    31/31 issues closed
  • Final ParlaMint II release of corpora MTed to English and semantically annotated (incl. the British corpus), 29 corpora: Reserved handle: - http://hdl.handle.net/11356/1864 : Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 4.0 Base release in original languages: - http://hdl.handle.net/11356/1860 : Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.0 As opposed to ParlaMint-en 3.0, this version: * adds USAS semantic tags to words * adds MTed ES, ES-PV, FI and extends AT, CZ, HU, UA * adds metadata from CHES and Wikipedia (political orientations, ministers) * corrects some problems with encoding, metadata, and downstream conversions

    Due by November 14, 2023
    5/5 issues closed
  • First release of the ParlaMint II project corpora machine translated to English. For each submitted corpus all transcriptions should be present, and metadata largely ok. Reserved handle: * http://hdl.handle.net/11356/1810: Linguistically annotated multilingual comparable corpora of parliamentary debates in English ParlaMint-en.ana 3.0

    Due by August 12, 2023
    1/1 issues closed
  • Final ParlaMint II release. Reserved handles: * http://hdl.handle.net/11356/1859 : Multilingual comparable corpora of parliamentary debates ParlaMint 4.0 * http://hdl.handle.net/11356/1860 : Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 4.0 As opposed to 3.0, this version: * adds ES, FI and extends AT, CZ, HU, UA * adds metadata from [CHES](https://www.chesdata.eu/) and Wikipedia ([political orientations](https://en.wikipedia.org/wiki/Left%E2%80%93right_political_spectrum), [ministers](https://en.wikipedia.org/wiki/Minister_(government))) * corrects many problems with encoding, metadata, and downstream conversions.

    Due by October 23, 2023
    58/58 issues closed
  • First release of the ParlaMint II project corpora. For each submitted corpus all transcriptions should be present, and metadata largely ok. Reserved handles: * http://hdl.handle.net/11356/1486 : Multilingual comparable corpora of parliamentary debates ParlaMint 3.0 * http://hdl.handle.net/11356/1488 : Linguistically annotated multilingual comparable corpora of parliamentary debates ParlaMint.ana 3.0

    Due by July 1, 2023
    17/17 issues closed
  • Due by May 31, 2021
    4/4 issues closed
  • Due by May 14, 2021
    50/50 issues closed