Smart Qualitative Data: Methods and Community Tools for Data Mark-Up
SQUAD was a demonstrator project that explored methodological and technical solutions for exposing digital qualitative data to make them fully shareable, exploitable and archivable for the longer term. Describing context, XML standards and text mining tools for anonymising data were used.
Initially, the project dealt with specifying and testing
flexible means of storing and marking-up, or annotating,
qualitative data using universal standards and technologies,
through eXtensible Mark-up Language (XML). Such tools are
required to exploit fully the potential of qualitative data for
adventurous collaborative research using web-based and e-science
systems. An example of the latter might be linking multiple
data and information sources, such as text, statistics and maps. A
community standard, or schema, was proposed that would be
applicable to most kinds of qualitative data which might be able to
function as a longer-term preservation format.
The second strand investigated optimal requirements for describing or 'contextualising' research data (e.g. interview setting or interviewer characteristics), aiming to develop standards for data documentation and ways of capturing this information.
The third strand experimented with natural language processing technologies to develop and implement user-friendly tools for semi-automating processes to prepare marked-up qualitative data.
The project furthered research tools for publishing (e.g. for web interrogation) and archiving enriched marked-up data and associated research materials.
Staff from ESDS Qualidata at the UK Data Archive led the project. They provided the qualitative data mark-up schema, defined needs, provided sample qualitative data and manually created XML marked-up text. Claire Grover's team at the Human Communication Research Centre in Edinburgh were responsible for developing the natural language processing toolsets, including automated XML mark-up and friendly JAVA interfaces to the mark-up tools. Both sites contributed to user testing, evaluation and documentation activities.
Dates: March 2005 - August 2006
Contact: Louise Corti
Links: Final report to ESRC
Searching and sharing qualitative data: the uses of XML