@inproceedings{savy-etal-2006-multilevel,
    title = "Multilevel corpus analysis: generating and querying an {AG}set of spoken {I}talian ({S}p{I}t-{MD}b).",
    author = "Savy, Renata  and
      Cutugno, Francesco  and
      Crocco, Claudia",
    editor = "Calzolari, Nicoletta  and
      Choukri, Khalid  and
      Gangemi, Aldo  and
      Maegaard, Bente  and
      Mariani, Joseph  and
      Odijk, Jan  and
      Tapias, Daniel",
    booktitle = "Proceedings of the Fifth International Conference on Language Resources and Evaluation ({LREC}{'}06)",
    month = may,
    year = "2006",
    address = "Genoa, Italy",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://preview.aclanthology.org/ingest-emnlp/L06-1295/",
    abstract = "In this paper we present an application of AGTK to a corpus of spoken Italian annotated at many different linguistic levels. The work consists of two parts: a) the presentation of AG-SpIt, a toolkit devoted to corpus data management that we developed according to AGTK proposals; b) the presentation of corpus structure together with some examples and results of cross-level linguistic analyses obtained querying the database (SpIt-MDb). As this work is still an ongoing investigation, results must be considered preliminary, as a demo illustrating the potentiality of the tool and the advantages it introduces to validate linguistic theories and annotation systems. Currently, SpIt-MDb is a linguistic resource under development; it represents one of the first attempts to create an Italian corpus labelled at various linguistic levels (from acoustic/sub-phonetic, to textual/pragmatic ones) which can be queried in the interrelations among levels."
}Markdown (Informal)
[Multilevel corpus analysis: generating and querying an AGset of spoken Italian (SpIt-MDb).](https://preview.aclanthology.org/ingest-emnlp/L06-1295/) (Savy et al., LREC 2006)
ACL