Here is an example for text processing. We start with the most famous novel in English. Here is a scan of the first page of chapter 1 from my bound copy:
data:image/s3,"s3://crabby-images/8e2e4/8e2e4fcd42b7cc6e191d79ba010afc8f433b6fa2" alt=""
I love the layout. I got the text from Project Gutenberg (previous link) and modified it by stripping out some header text. We use python to open it and count the combinations of two letters. All 26 x 26 possibilities are saved in a dictionary, like this:
data:image/s3,"s3://crabby-images/5aa69/5aa69480a899a7b282c1213abba7df95e9ff8673" alt=""
After writing the results to disk, we open it with R and plot. The combined code weighs in at a hefty 70 lines or so. It looks pretty impressive. The size of the circle is proportional to the logarithm of the count.
data:image/s3,"s3://crabby-images/35b3f/35b3f5fb0437ce1c41868f0d2e0dbb2609bb23b5" alt=""
Update: I realize now that this might not be exactly what you want. One might be interested in the differences in frequency of the second letter, given the first letter. For that, we need to normalize for the frequency of the first letter.