Thanks a lot for the feedback! Really appreciate you trying it out. For longer videos, transcription can definitely take a bit... I'm working on moving processing to more powerful servers (currently running off my home machine)
For a quick overview on how it works and to get past those initial steps, this demo video of ragsplain might help:
https://youtu.be/BwuA_e7Xn74