To: seminaire@pauillac.inria.fr From: Didier.Remy@inria.fr Subject: SEM - INRIA : Cristal - 14/03/03 - Paris - FR Vous pouvez maintenant vous abonner à nos annonces de séminaires http://pauillac.inria.fr/seminaires/subscribe.html S E M I N A I R E . ___ / _ _ / _ / / / \ / \ / / __| / |___ |_/ |_/ / |__ |_/ |_ ___ . / / ___ __ /_ _ / _/ /| /| _ __ __ _ _ / / / /_ / __| / / |/ | / \ /_ / / \ | / __| |___ / / __/ |_ |_/ |_ / | |_/__/ |_ |_/ |/ |_/ I N R I A - Rocquencourt Salle de conference du Alan Turing, Bat 1 Vendredi 14 mars, 10h30 ------------ Haruo Hosoya ------------ RIMS, Kyoto University. ================================== Principles of typed XML processing ================================== XML is a standard exchange format for tree structure data and quickly became popular recently. XML's main strength is its ability to give structural constraints on data by using types. In this talk, I overview the series of work done in the XDuce project for designing a statically typed language for XML processing. The key technique here is to use tree language theory as the basis. Specifically, we adopt tree regular expressions as types, extend these to a pattern matching facility, and perform typechecking by operations on them. Necessary algorithmic problems are then solved by converting types to finite tree automata. In particular, we can thus use the language inclusion between tree regular expressions as subtyping, which is literally the most powerful since it allows all subtype relations that are semantically sound. We also use boolean operations such as intersection and difference for analyses on pattern matches. The resulting type system is simple, elegant, and expressive and thereby should be easy for the user to understand. I also mention our recent efforts for incorporating the attribute typing mechanism of RELAX NG to our type system. This work is an outcome from my collaboration with Benjamin Pierce, Jerome Vouillon, and Makoto Murata.