Personal Information
| Name: |
Ignacio Garcia del Campo |
| DOB: |
November 10th, 1981 |
| Nationality: |
Spain (H-1B U.S. Visa) |
| Email Address: |
igc.csmail [at] gmail [dot] com |
| Website: |
http://www.igc-web.com |
Education
Masters Degree in Computer Science, 2006
Old Dominion University, Norfolk, VA
Coursework concentration in Databases and Digital Libraries
MS Project Title: "Website Mirroring and Harvesting Tools - A Comparison"
GPA: 3.90
Bachelor of Science in Computer Science, 2004
Old Dominion University, Norfolk, VA
Minor: Information Technology
GPA: 3.27
Relevant Coursework, Honors and Awards:
- Named three times to the Dean's List
- Professional Workforce Development Courses
Work Experience
CACI Int. - Software Developer (Level II)
Creation, Maintenance, and Dissemination of XML-based Format Sustainability Assessment Documents [Feb.'07 to ...]
- Supervisor: Caroline Arms (LOC) and Donald Emerson (CACI)
- Principal Developer
- http://www.digitalpreservation.gov/formats
- The Library of Congress whishes to take the growing collection of format description documents (FDDs)
currently maintained in HTML and establish a workflow for creation, maintenance and dissemination of FDDs using XHML Schema markup as the master encoding.
The aim of the projet is to develop and XML-based representation for FDDs that will support both efficient creation of FDDs and the current Web Site functionality. Based
on this XML Schema, transformations from XML to both HTML and PDF will be developed to assist the Library of Congress
in evaluating the integration of newly created FDDs into the Web Site and compliance with Library policies.
Web Archive Tools [Feb.'07 to Sept.'07]
- Supervisor: Laura Graham (LOC) and Michael Ivey (CACI)
- Principal Developer
- The aim of this project is to set up, test, identify the requirements and make recommendations for deploying a suite of open source tools within the
Library of Congress technical environment. The tools involved in the project are The Wayback Machine,
NutchWax, Hadoop and the 20th Century Search.
Web Archiving and Retrieval Appliance (WARA) [March'07 and April'07]
- Supervisor: Donald Emerson (CACI)
- Principal Developer, in charge of creating the appliance and testing it.
- WARA is a VMWare Appliance solution for web archiving and retrieval featuring Apache Tomcat, Wayback and Heritrix over Ubuntu OS.
WARA offers several of the leading web harvest, capture, and preservation software components from the Internet Archive, conveniently rolled up into one easy to use appliance.
Graduate Research Assistant
Library of Congress: Harvest Streaming Media With Heritrix and Retriever Tools [Fall'06]
- Supervisor: Michael Ashenfelder (LOC) and
Dr. Michael Nelson (ODU)
- Principal Developer, Perl code.
- The aim of this project is to integrate the Web harvesting tool Heritrix with other retrieval software tools (Mplayer).
The retrieval tool has the potential to download a variety of files that Heritrix currently has difficulty getting.
In this phase of the project the focus is on downloading audio/video files, or streaming media. The tool will be responsible for
listing and downloading all audio/video files that Heritrix did not get, and pack them into ARC files with the same format used by Heritrix.
- Supervisor: Dr. Michael Nelson (ODU)
- Principal Developer, in charge of creating cross-compatibility with Mac OSX and extending the module's functionality. C# code.
- http://www.modoai.org
- The aim of this project is to create the mod_oai Apache software module that will expose content accessible from Apache Web servers,
via the Open Archive Initiative Protocol for Metadata Harvesting (OAI-PMH). The mod_oai project (Old Dominion University/Los Alamos National Laboratories)
is funded by the Andrew W. Mellon Foundation. The Apache Web server defines an extensible module format that allows specific functionality to be incorporated
directly into the Web server. The mod_oai module is able to respond to OAI-PMH requests pertaining to files made accessible by the Apache server.
Technical Skills
| Development Enviroments |
Programming Languages |
Web Related |
Software Applications |
Linux/Unix
Windows
Mac OSX
|
C#
C++
Perl 5
PHP
Java
PLSQL
|
XHTML
XML
CSS
JavaScript
Ajax
Flash
|
VMWare
Apache Tomcat
Paint Shop Pro
NSIS
Heritrix
Wayback
|
Communication Skills
Spanish (native)
Second Language (fluent): English
Worked in individual and group projects with presentations
Publications
Efficient, Automatic Web Resource Harvesting
Michael L. Nelson, Joan A. Smight, Ignacio Garcia del Campo, Herbert Van de Sompel, Xiaoming Liu
Proceedings of WIDM 2006, pp. 43-50
Study of Website Mirroring and Harvsting Tools
JCDL 2007, submitted for publication
Harvest Streaming Media with Heritrix and MPlayer
JCDL 2007, submitted for publication