Syllabus: in HTML | in XML | in XML + XSLT
Lectures | Tools | Projects | Assignments | Final
These are useful documents worth browsing. They may or may not show up in the syllabus.
We'll discuss these in detail starting in class #4. The projects are meant to take about six weeks and should be chosen from the list below. I'm open to suggestions if there's something on this list that you'd really like to work on, and there are a couple of other possibilities that may shape up before class #4, when I'll ask you to commit to one.
In my day job at Lulu, I'm working on a project to enable the creation of textbooks and teaching materials using a free online collaborative process. If you're interested, you can contribute by writing a chapter for an XML book. The project will allow both free online browsing, and the assembly and printing of full books from sections available on the site. The printed books will be purchasable through Lulu. If you do this, you will have to agree to license your writing under a Creative Commons Attribution license. Some ideas for sections are listed below:
[Aaron Brubaker]
[Carl Harris]
[Daniel Lucas]
[Janhavi Sheode]
[Bendte Fagge]
We have a huge repository of XML documents on campus (Documenting the American South) that are marked up using the TEI guidelines. These are transformed into HTML for web presentation, but a toolset for transforming them into PDF would be nice too. There are XSLT stylesheets for producing XSL Formatting Objects written for generic TEI, and XSL-FO documents can be turned into PDF using free tools like FOP.
[Sean Chen]
How do you search an XML repository? DocSouth could use its own full text search capability (right now they use Google). This project might involve an investigation of the possibilities with the goal being a recommendation, or it could be an experimental implementation using a free search engine tool like Lucene. Or both.
If we have anyone who's also taking Stephanie Haas's NLP class, there is the possibility of double-dipping, i.e., working on a project that straddles both classes. This might involve developing XML Schemas for customizing Named Entity Recognition using GATE.
[Sara Gault]
[Marcos Rodriguez]
I've been involved with a project called EpiDoc for some years now that aims to provide guidelines for marking up ancient inscriptions using TEI XML. The guidelines will use a combination of Schematron and XSLT 2.0 to "unit test" entries. The core technology is in place, but the work of adding test rules to the individual guideline pages has stalled. You can help apply bleeding edge technology to the study of the ancient world!
[Tim Baldwin]
[Ric Simmons]
The Pleiades project is an initiative coming out of the Ancient World Mapping Center at UNC to develop online mapping tools for the Ancient World. They need access to new publications in areas they are interested in. Tom Elliott, the lead developer of the project sends this request:
We would use this to feed bibliographic data into Pleiades for subsequent use, refinement, etc. by collaborators. We would also expect to expose the feeds perpetually for the use of others.
All assignments are to be delivered via email and are due before the following class.
The final is due by 5:00 pm on December the 14th. It consists of two parts. You should plan to spend less than 3 hours to complete it. There isn't a strict time limit, but if you're taking longer than that, then you're doing too much work. The exam is open book/internet/etc. You are expected to deploy all the means available to you to complete the test, with the exception that, as usual, you are not allowed to give nor receive help on the test. The exam should be delivered to me via email before the due date, and you will receive confirmation of its receipt from me. If you don't receive confirmation within a reasonable amount of time, please follow up with me to make certain I have received your exam.