Personal Information

Name: Ignacio Garcia del Campo
DOB: November 10th, 1981
Nationality: Spain (H-1B U.S. Visa)
Email Address: igc.csmail [at] gmail [dot] com
Website: http://www.igc-web.com

Education

Masters Degree in Computer Science, 2006

Old Dominion University, Norfolk, VA
Coursework concentration in Databases and Digital Libraries
MS Project Title: "Website Mirroring and Harvesting Tools - A Comparison"
GPA: 3.90

Bachelor of Science in Computer Science, 2004

Old Dominion University, Norfolk, VA
Minor: Information Technology
GPA: 3.27
Relevant Coursework, Honors and Awards:
  • Named three times to the Dean's List
  • Professional Workforce Development Courses

Work Experience

CACI Int. - Software Developer (Level II)

Creation, Maintenance, and Dissemination of XML-based Format Sustainability Assessment Documents [Feb.'07 to ...]

  • Supervisor: Caroline Arms (LOC) and Donald Emerson (CACI)
  • Principal Developer
  • http://www.digitalpreservation.gov/formats
  • The Library of Congress whishes to take the growing collection of format description documents (FDDs) currently maintained in HTML and establish a workflow for creation, maintenance and dissemination of FDDs using XHML Schema markup as the master encoding.
    The aim of the projet is to develop and XML-based representation for FDDs that will support both efficient creation of FDDs and the current Web Site functionality. Based on this XML Schema, transformations from XML to both HTML and PDF will be developed to assist the Library of Congress in evaluating the integration of newly created FDDs into the Web Site and compliance with Library policies.

Web Archive Tools [Feb.'07 to Sept.'07]

  • Supervisor: Laura Graham (LOC) and Michael Ivey (CACI)
  • Principal Developer
  • The aim of this project is to set up, test, identify the requirements and make recommendations for deploying a suite of open source tools within the Library of Congress technical environment. The tools involved in the project are The Wayback Machine, NutchWax, Hadoop and the 20th Century Search.

Web Archiving and Retrieval Appliance (WARA) [March'07 and April'07]

  • Supervisor: Donald Emerson (CACI)
  • Principal Developer, in charge of creating the appliance and testing it.
  • WARA is a VMWare Appliance solution for web archiving and retrieval featuring Apache Tomcat, Wayback and Heritrix over Ubuntu OS.
    WARA offers several of the leading web harvest, capture, and preservation software components from the Internet Archive, conveniently rolled up into one easy to use appliance.

Graduate Research Assistant

Library of Congress: Harvest Streaming Media With Heritrix and Retriever Tools [Fall'06]

  • Supervisor: Michael Ashenfelder (LOC) and Dr. Michael Nelson (ODU)
  • Principal Developer, Perl code.
  • The aim of this project is to integrate the Web harvesting tool Heritrix with other retrieval software tools (Mplayer). The retrieval tool has the potential to download a variety of files that Heritrix currently has difficulty getting. In this phase of the project the focus is on downloading audio/video files, or streaming media. The tool will be responsible for listing and downloading all audio/video files that Heritrix did not get, and pack them into ARC files with the same format used by Heritrix.

Old Dominion University: mod_oai [Spring'06 and Summer'06]

  • Supervisor: Dr. Michael Nelson (ODU)
  • Principal Developer, in charge of creating cross-compatibility with Mac OSX and extending the module's functionality. C# code.
  • http://www.modoai.org
  • The aim of this project is to create the mod_oai Apache software module that will expose content accessible from Apache Web servers, via the Open Archive Initiative Protocol for Metadata Harvesting (OAI-PMH). The mod_oai project (Old Dominion University/Los Alamos National Laboratories) is funded by the Andrew W. Mellon Foundation. The Apache Web server defines an extensible module format that allows specific functionality to be incorporated directly into the Web server. The mod_oai module is able to respond to OAI-PMH requests pertaining to files made accessible by the Apache server.

Old Dominion University: Teacher Assistant cs350 [Fall'04 to Fall'05]

Technical Skills

Development Enviroments Programming Languages Web Related Software Applications
Linux/Unix
Windows
Mac OSX
C#
C++
Perl 5
PHP
Java
PLSQL
XHTML
XML
CSS
JavaScript
Ajax
Flash
VMWare
Apache Tomcat
Paint Shop Pro
NSIS
Heritrix
Wayback

Communication Skills

Spanish (native)

Second Language (fluent): English

Worked in individual and group projects with presentations

Publications

Efficient, Automatic Web Resource Harvesting

Michael L. Nelson, Joan A. Smight, Ignacio Garcia del Campo, Herbert Van de Sompel, Xiaoming Liu

Proceedings of WIDM 2006, pp. 43-50

Study of Website Mirroring and Harvsting Tools

JCDL 2007, submitted for publication

Harvest Streaming Media with Heritrix and MPlayer

JCDL 2007, submitted for publication