So you’re stuck…

Here are some of the most common questions we are asked about Captionista.

If your question isn’t listed here please contact us by one of the methods shown here.

How do I split text into subtitles?

Once you have typed all your text it is time to split it in the Timeline screen.

Step 1

On iPhone: Press Next to get there from the Script editor screen.

On iPad: Tap the Timeline button to switch to timeline mode.

Step 2

Play the video and tap a word in the text on the subtitle card where you want to split into a new subtitle.

Captionista uses the current video playback time when you tap the word to also set the start time for the new subtitle you just created, all in one action!

How do I make the first subtitle start later?

In Timeline mode, play the video and press the little start time button on the subtitle card to set it to the current player time. You just watch and then tap when it should start.

If your new start time means that the subtitle overlaps in time with another subtitle you already timed, it will merge the two automatically for you.

How do I make gaps in time between subtitles?

Gaps between subtitles can actually improve how your subtitles feel, if there is too much time passing between the text for the previous subtitle to stay on screen.

To solve this, while watching the video you tap the end time button on the subtitle card when you want the subtitle to stop showing. This will automatically create a gap consisting of the remaining time until the next subtitle starts.

Is there a version for macOS?

Not yet. We haven’t ruled this out but the needs of professional Mac users would not be well met by the current Captionista timeline UI which is designed for touch.

It’s not just about using a mouse or trackpad — Mac users will need different considerations, like support for very large (long) video files and this changes the nature of the UI and document formats required.

Many other apps are available to deliver automatic subtitles, and the value Captionista adds is in the better subtitling experience you can produce for your viewers and how you can subtitle short videos in a matter of seconds.

Why is there no A.I. Text-to-Speech?

Now this is a great question! Captionista is “opinionated software”. It was designed from the start to provide a great fast experience adding subtitles or captions without any need for A.I.

Quality issues

The best subtitles use good timing and careful splitting of text. This is typically missing if you use A.I. transcription, and it can take a long time to achieve this with conventional subtitling software after using the A.I. to transcribe. This can cancel out the benefits of not having to type most of the text.

Assuming you care about the text matching what is said, Jkust correcting errors in the A.I. transcript can be very time consuming and itself prone to error. It can take as long as it would to type it yourself.

Most A.I. Text-to-Speech subtitling systems try to match timing of spoken text exactly, but this often results in an unnatural reading experience that lacks nuance in delivery. Typically this is handled by using “rolling” subtitles that show a word or line at a time, moving up the screen as new text appears. This is generally a poor reading experience.

Furthermore, A.I. tends to not capitalise or punctuate text properly, often including correctly spelled but nonsensical words which can be hard to see when reviewing, leading to embarrassing mistakes.

Audio issues

A.I. transcription struggles with noisy audio, and very many social videos are recorded in noisy environments or without special microphones to capture a clear audio track. Correcting an A.I. transcription of “misheard” audio, you might as well start from scratch.

Language limitations

We’re an international product. Most A.I. systems cannot transcribe many languages, so manual text entry is superior here.

Network access

Most A.I. Text-to-Speech solutions require network access that slows down your transcription process, uses up mobile data and by definition cannot work when you have poor data connectivity.

Captionista always works, even if you have no internet access at all. You can subtitle in a bunker, on a plane, or in a Faraday Cage if you wish.


We cannot make our own A.I. Text-to-Speech system. Very few companies actually do, and as a result we would have no control over the quality of the results and so a poor product makes our product look worse and we have no way to improve it. We’re not prepared to do that.

Costs & Privacy

Most A.I. solutions require datacenters to process the audio and they are expensive services to use. We’d have to charge you the custom a much higher price and/or charge per minute of transcription like a lot of other subtitling apps do.

Furthermore, any services that don’t charge are likely to be using the audio for other purposes and this can have privacy issues for our users.