I’ve spent the last ten years trying to feed technologies and insights from Linguistics and Computational Linguistics into the infrastructure of the Web. In this talk I’ll give brief but intense introductions to four areas of research interest from (C)L and related disciplines which have the potential for making a real impact on the way the Web works. Dependent on who’s there, we may dive deeper into one or more of them, time permitting:
- A novel declarative approach to fixup of broken XML/(X)HTML
- Counter-augmented Finite-State Automata for parsing XML
- Functional XML – Self-describing documents meet the lambda calculus
- Identity, URIs and the (Semantic) Web
See http://www.ltg.ed.ac.uk/~ht/msr_20070928.html for an extended abstract.