Text analysis workflow (Digital Humanities)¶

What this tutorial does¶

This tutorial shows how a typical Digital Humanities text analysis workflow is carried out using UCT eResearch services.

It connects data storage, transfer, computation, and sharing into a single workflow.

You should:

If you are unsure where to begin:

This workflow follows four stages:

Store your raw and working data in a reliable location before analysis.

Move your dataset to the environment where analysis will run.

Run your scripts or tools to process and analyse the text corpus.

Store results, organise outputs, and share with collaborators if needed.

In Digital Humanities projects, this workflow often involves:

You have:

You can extend this workflow by:

collaborating on code and analysis workflows
→ Collaborate on code
improving reproducibility and version control
→ Work with code repositories
scaling analysis for larger datasets or more complex workflows
→ Run large-scale analysis