Petr Sgall - ACL Anthology

This is an internal, incomplete preview of a proposed change to the ACL Anthology. For efficiency reasons, we don't generate MODS or Endnote formats, and the preview may be incomplete in other ways, or contain mistakes. Do not treat this content as an official publication.

Petr Sgall

Also published as: P. Sgall

2012

We introduce a substantial update of the Prague Czech-English Dependency Treebank, a parallel corpus manually annotated at the deep syntactic layer of linguistic representation. The English part consists of the Wall Street Journal (WSJ) section of the Penn Treebank. The Czech part was translated from the English source sentence by sentence. This paper gives a high level overview of the underlying linguistic theory (the so-called tectogrammatical annotation) with some details of the most important features like valency annotation, ellipsis reconstruction or coreference.

2006

Corpus Annotation as a Test of a Linguistic Theory
Eva Hajičová | Petr Sgall
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)

In the present contribution we claim that corpus annotation serves, among other things, as an invaluable test for linguistic theories standing behind the annotation schemes, and as such represents an irreplaceable resource of linguistic information for the build-up of grammars. To support this claim we present four linguistic phenomena for the study and relevant description of which in grammar a deep layer of corpus annotation as introduced in the Prague Dependency Treebank has brought important observations, namely the information structure of the sentence, condition of projectivity and word order, types of dependency relations and textual coreference.

2004

Deep Syntactic Annotation: Tectogrammatical Representation and Beyond
Petr Sgall | Jarmila Panevová | Eva Hajičová
Proceedings of the Workshop Frontiers in Corpus Annotation at HLT-NAACL 2004

2002

The Simple Core and the Complex Periphery of Natural Language - a Formal and a Computational View
Petr Sgall | Alena Bŏhmová
COLING 2002: The 19th International Conference on Computational Linguistics

A Machine Learning Approach to Automatic Functor Assignment in the Prague Dependency Treebank
Zdeněk Žabokrtský | Petr Sgall | Sašo Džeroski
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

2001

Topic-focus and Salience
Eva Hajičová | Petr Sgall
Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics

2000

Tagging of very large corpora: Topic-Focus Articulation
Eva Buráňová | Eva Hajičová | Petr Sgall
COLING 2000 Volume 1: The 18th International Conference on Computational Linguistics

After a brief characterization of the theory of the topic-focus articulation of the sentence (TFA), rules are formulated that determine the assignment of appropriate values of the TFA attribute in the process of syntactico-semantic tagging of a very large corpus of Czech.

Semantico-syntactic Tagging of Very Large Corpora: the Case of Restoration of Nodes on the Underlying Level
Eva Hajičová | Petr Sgall
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)

Coreference in Annotating a Large Corpus
Eva Hajičová | Jarmila Panevová | Petr Sgall
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)

Automatic Procedures in Tectogrammatical Tagging
Alena Böhmová | Petr Sgall
Proceedings of the COLING-2000 Workshop on Linguistically Interpreted Corpora

1999

On intermediate structures and tectogrammatics
Petr Sgall
EAMT Workshop: EU and the new languages

1995

An Automatic Procedure for Topic-Focus Identification
Eva Hajičová | Hana Skoumalová | Petr Sgall
Computational Linguistics, Volume 21, Number 1, March 1995

The dichotomy of topic and focus, based, in the Praguean Functional Generative Description, on the scale of communicative dynamism, is relevant not only for a possible placement of the sentence in a context, but also for its semantic interpretation. An automatic identification of topic and focus may use the input information on word order, on the systemic ordering of kinds of complementations (reflected by the underlying order of the items included in the focus), on definiteness, and on lexical semantic properties of words. An algorithm for the analysis of English sentences has been implemented and is discussed and illustrated on several examples.

1993

Identifying Topic and Focus by an Automatic Procedure
Eva Hajičová | Petr Sgall | Hana Skonmalovlá
Sixth Conference of the European Chapter of the Association for Computational Linguistics

An algorithm for automatic identification of topic and focus of the sentence is presented, based on dependency syntax and using written input, which is much more ambiguous than spoken utterance.

1989

Book Reviews: Natural Language Parsing Systems
Petr Sgall
Computational Linguistics, Volume 15, Number 2, June 1989

1987

Machine Translation, Linguistics, and Interlingua
Petr Sgall | Jarmila Panevová
Third Conference of the European Chapter of the Association for Computational Linguistics

1986

Degrees of Understanding
Eva Hajičová | Petr Sgall
Coling 1986 Volume 1: The 11th International Conference on Computational Linguistics

1985

Towards an Automatic Identification of Topic and Focus
Eva Hajičová | Petr Sgall
Second Conference of the European Chapter of the Association for Computational Linguistics

1983

Structure of Sentence and Inferencing in Question Answering
Eva Hajičová | Petr Sgall
First Conference of the European Chapter of the Association for Computational Linguistics

In the present paper we characterize in more detail some of the aspects of a question answering system using as its starting point the underlying structure of sentences (which with some approaches can be identified with the level of meaning or of logical form). First of all, the criteria are described that are used to identify the elementary units of underlying structure and the operations conjoining them into complex units (Sect. 1), then the main types of units and operations resulting from an empirical investigation on the basis of the criteria are registered (Sect. 2), and finally the rules of inference , accounting for the relevant aspects of the relationship between linguistic and cognitive structures are illustrated (Sec. 3).

1982

Natural Language Understanding and the Perspectives of Question Answering
Petr Sgall
Coling 1982: Proceedings of the Ninth International Conference on Computational Linguistics

1980

Linguistic Meaning and Knowledge Representation in Automatic Understanding of Natural Language
Eva Hajičová | Petr Sgall
COLING 1980 Volume 1: The 8th International Conference on Computational Linguistics

The necessity of and means for distinguishing between a level of linguistic meaning and a domain of "factual knowledge" (or cognitive content) are argued for, supported by a survey of relevant operational criteria. The level of meaning is characterized as a safe base for computational applications, which allows for a set of inference rules accounting for the content (factual relations) of a given domain.

1969

SOME REMARKS ON J. L. MEY’s PAPER (Preprint No. 20)
P. Sgall | E. Hajičová
International Conference on Computational Linguistics COLING 1969

1965

Generation, Production, and Translation
Petr Sgall
COLING 1965

Co-authors

Venues