Popularity

1.8

Stable

Activity

4.7

Stars 52

Watchers 5

Forks 3

Last Commit over 2 years ago

Description

Spokestack is an all-in-one solution for mobile voice interfaces on Android. It provides every piece of the speech processing puzzle, including voice activity detection, wakeword detection, speech recognition, natural language understanding (NLU), and speech synthesis (TTS). Under its default configuration (on newer Android devices), everything except TTS happens directly on the mobile device—no communication with the cloud means faster results and better privacy.

And Android isn't the only platform it supports!

Programming language: Java

License: Apache License 2.0

Tags: Android App Java Android-library

Spokestack alternatives and similar packages

Based on the "App" category.
Alternatively, view spokestack-android alternatives based on common mentions on social networks and blogs.

TextSecure

9.9 9.9 L2 Spokestack VS TextSecure

A private messenger for Android.
BlackHole

9.6 9.4 Spokestack VS BlackHole

DISCONTINUED. A Music Player App made with Flutter

WorkOS - The modern identity platform for B2B SaaS

The APIs are flexible and easy-to-use, supporting authentication, user identity, and complex enterprise features like SSO and SCIM provisioning.

Promo workos.com

wechat

9.5 0.0 L1 Spokestack VS wechat

A High Copy WeChat ,SNS APP (高仿微信)
HomeMirror

9.5 0.0 L3 Spokestack VS HomeMirror

Android application powering the mirror in my house
uhabits

9.4 8.4 L3 Spokestack VS uhabits

Loop Habit Tracker, a mobile app for creating and maintaining long-term positive habits
ViMusic

9.4 0.0 Spokestack VS ViMusic

An Android application for streaming music from YouTube Music.
InstaMaterial

9.4 0.0 L5 Spokestack VS InstaMaterial

Implementation of Instagram with Material Design (originally based on Emmanuel Pacamalan's concept)
#<Sawyer::Resource:0x00007fe2aa53a5b8>

9.4 9.9 Spokestack VS #<Sawyer::Resource:0x00007fe2aa53a5b8>

An alternative frontend for YouTube, for Android.
AmazeFileManager

9.3 9.5 L1 Spokestack VS AmazeFileManager

Material design file manager for Android
MvRx

9.2 6.3 Spokestack VS MvRx

Mavericks: Android on Autopilot
WordPress-Android

9.0 10.0 L2 Spokestack VS WordPress-Android

WordPress for Android
FlyRefresh

8.7 0.0 L5 Spokestack VS FlyRefresh

The implementation of https://dribbble.com/shots/2067564-Replace
Lightning Browser

8.7 2.6 L1 Spokestack VS Lightning Browser

A lightweight Android browser with modern navigation
ForkHub

8.5 0.0 Spokestack VS ForkHub

GitHub client for Android based on the abandoned official app
Twidere-Android

8.5 0.0 L2 Spokestack VS Twidere-Android

Twidere is a powerful twitter client for Android 1.6+ 1 , which gives you a full Holo experience and nearly full Twitter's feature.
Telecine

8.5 1.5 L4 Spokestack VS Telecine

DISCONTINUED. Record full-resolution video on your Android devices.
MaterialAudiobookPlayer

8.2 9.7 Spokestack VS MaterialAudiobookPlayer

Minimalistic audiobook player
Foodium 🍲

8.2 0.0 Spokestack VS Foodium 🍲

🍲Foodium is a sample food blog Android application 📱 built to demonstrate the use of Modern Android development tools - (Kotlin, Coroutines, Flow, Dagger 2/Hilt, Architecture Components, MVVM, Room, Retrofit, Moshi, Material Components).
Bandhook-Kotlin

8.1 0.0 Spokestack VS Bandhook-Kotlin

A showcase music app for Android entirely written using Kotlin language
2048-android

8.1 0.0 L3 Spokestack VS 2048-android

The android port of the 2048 game (for offline playing)
jianshi

8.1 0.0 L4 Spokestack VS jianshi

A Full-Stack mobile app, including Android & Server, Simple-Poem 简诗. You can write poem in graceful & traditional Chinese style.
Etar Calendar

8.0 9.2 L1 Spokestack VS Etar Calendar

Android open source calendar
keepassdroid

7.7 2.9 L2 Spokestack VS keepassdroid

KeePass implementation for android
droidplanner

7.5 0.0 L3 Spokestack VS droidplanner

Ground Control Station for Android Devices
FeedEx

7.2 9.0 L2 Spokestack VS FeedEx

DISCONTINUED. Flym News Reader is a light Android feed reader (RSS/Atom)
Bourbon

6.8 0.0 L5 Spokestack VS Bourbon

An MVP Dribbble client for Android Mobile, Tablet, Wear and TV.
News-Android-App

6.8 9.4 L4 Spokestack VS News-Android-App

📱🗞️ Android client for the Nextcloud news/feed reader app
seadroid

6.5 4.4 L2 Spokestack VS seadroid

Android client for Seafile
DMPlayer

6.3 0.0 L2 Spokestack VS DMPlayer

DMPLayer is an Android music player prototype
clean-status-bar

6.2 0.0 L4 Spokestack VS clean-status-bar

Tidy up your Android status bar before taking screenshots for the Play Store
android-arsenal.com

6.2 0.0 Spokestack VS android-arsenal.com

Source to android-arsenal.herokuapp.com
Endoscope

6.2 0.0 L5 Spokestack VS Endoscope

Endoscope lets you to stream live video between android devices over Wi-Fi! 📱📲
sgtpuzzles

6.0 9.3 L1 Spokestack VS sgtpuzzles

Android port of Simon Tatham's Puzzles
MaterialUp

5.7 0.0 L4 Spokestack VS MaterialUp

MaterialUp Android App
Leisure

5.7 0.0 L4 Spokestack VS Leisure

Leisure is an Android App containing Zhihu Daily,Guokr Scientific,XinhuaNet News and Douban Books
WaniKani-for-Android

5.5 0.0 L3 Spokestack VS WaniKani-for-Android

DISCONTINUED. An Android client application for the awesome kanji learning website wanikani.com
AppIconNameChanger

5.4 0.0 Spokestack VS AppIconNameChanger

Library to change Android launcher App Icon and App Name programmatically !
LeeCo

5.3 0.0 L1 Spokestack VS LeeCo

LeeCo is an awesome app for (including unlock) problems, solutions, discuss(from leetcode) and comments.
OpenFlappyBird

5.1 0.0 L4 Spokestack VS OpenFlappyBird

An open source clone of a famous flappy bird game for Android using AndEngine
OpenLibra-Material

4.7 0.0 L5 Spokestack VS OpenLibra-Material

OpenLibra client on Material Design
OpenImgur

4.7 0.0 L3 Spokestack VS OpenImgur

Open source Imgur Android App
FoldingNavigationDrawer-Android

4.5 0.0 L3 Spokestack VS FoldingNavigationDrawer-Android

This is a sample project present how to use Folding-Android to add Folding Efect to Navigation Drawer.
Flick Launcher

4.5 0.0 L1 Spokestack VS Flick Launcher

Pixel Launcher for everyone!
TurtlePlayer

4.5 0.0 L2 Spokestack VS TurtlePlayer

A Free, Fully Fledged, Open-Source Music Player for Android
GradientDrawableTuner

4.3 0.0 Spokestack VS GradientDrawableTuner

🕹️ See how the properties of Android's "shape" affect the Drawable's appearance, intuitively.
MaterialDesignColorPalette

4.3 0.0 L3 Spokestack VS MaterialDesignColorPalette

This is a dev tool to visualize the colours of Material design defined on
HackerNews

4.1 0.0 L5 Spokestack VS HackerNews

An open source Hacker News client for Android.
freegemas-gdx

4.0 0.0 L1 Spokestack VS freegemas-gdx

Freegemas libGDX is an Android and Java desktop port of Freegemas, which in turn is an open source version of the well known Bejeweled.
vanilla

4.0 0.0 L1 Spokestack VS vanilla

Vanilla Music Player for Android (abandoned). Visit https://github.com/vanilla-music/vanilla for an actively developed fork
PopularMovies

3.9 0.0 Spokestack VS PopularMovies

:movie_camera: Movie discovery app showcasing Android best practices with Google's recommended architecture: MVVM + Repository + Offline support + Android Architecture Components + Paging library & Retrofit2.

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.

Do you think we are missing an alternative of Spokestack or a related project?

Add another 'App' Package

Popular Comparisons

README

And Android isn't the only platform it supports!

Creating a free account at spokestack.io lets you train your own NLU models and test out TTS without adding code to your app. We can even train a custom wakeword and TTS voice for you, ensuring that your app's voice is unique and memorable.

For a brief introduction, read on, but for more detailed guides, see the following:

Installation

Note: Spokestack used to be hosted on JCenter, but since the announcement of its discontinuation, we've moved distribution to Maven Central. Please ensure that your root-level build.gradle file includes mavenCentral() in its repositories block in order to access versions >= 11.0.2.

A Note on API Level

The minimum Android SDK version listed in Spokestack's manifest is 8 because that's all you should need to run wake word detection and speech recognition. To use other features, it's best to target at least API level 21.

If you include ExoPlayer for TTS playback (see below), you might have trouble running on versions of Android older than API level 24. If you run into this problem, try adding the following line to your gradle.properties file:

android.enableDexingArtifactTransform=false

Dependencies

Add the following to your app's build.gradle:

android {

  // ...

  compileOptions {
    sourceCompatibility JavaVersion.VERSION_1_8
    targetCompatibility JavaVersion.VERSION_1_8
  }
}

// ...

dependencies {
  // ...

  // make sure to check the badge above or "releases" on the right for the
  // latest version!
  implementation 'io.spokestack:spokestack-android:11.5.2'

  // for TensorFlow Lite-powered wakeword detection and/or NLU, add this one too
  implementation 'org.tensorflow:tensorflow-lite:2.6.0'

  // for automatic playback of TTS audio
  implementation 'androidx.media:media:1.3.0'
  implementation 'com.google.android.exoplayer:exoplayer-core:2.14.0'

  // if you plan to use Google ASR, include these
  implementation 'com.google.cloud:google-cloud-speech:1.22.2'
  implementation 'io.grpc:grpc-okhttp:1.28.0'

  // if you plan to use Azure Speech Service, include these, and
  // note that you'll also need to add the following to your top-level
  // build.gradle's `repositories` block:
  // maven { url 'https://csspeechstorage.blob.core.windows.net/maven/' }
  implementation 'com.microsoft.cognitiveservices.speech:client-sdk:1.9.0'

}

Usage

See the quickstart guide for more information, but here's the 30-second version of setup:

You'll need to request the RECORD_AUDIO permission at runtime. See our skeleton project for an example of this. The INTERNET permission is also required but is included by the library's manifest by default.
Add the following code somewhere, probably in an Activity if you're just starting out:

private lateinit var spokestack: Spokestack

// ...
spokestack = Spokestack.Builder()
    .setProperty("wake-detect-path", "$cacheDir/detect.tflite")
    .setProperty("wake-encode-path", "$cacheDir/encode.tflite")
    .setProperty("wake-filter-path", "$cacheDir/filter.tflite")
    .setProperty("nlu-model-path", "$cacheDir/nlu.tflite")
    .setProperty("nlu-metadata-path", "$cacheDir/metadata.json")
    .setProperty("wordpiece-vocab-path", "$cacheDir/vocab.txt")
    .setProperty("spokestack-id", "your-client-id")
    .setProperty("spokestack-secret", "your-secret-key")
    // `applicationContext` is available inside all `Activity`s
    .withAndroidContext(applicationContext)
    // see below; `listener` here inherits from `SpokestackAdapter`
    .addListener(listener)
    .build()

// ...

// starting the pipeline makes Spokestack listen for the wakeword
spokestack.start()

This example assumes you're storing wakeword and NLU models in your app's cache directory; again, see the skeleton project for an example of decompressing these files from the assets bundle into this directory.

To use the demo "Spokestack" wakeword, download the TensorFlow Lite models: detect | encode | filter

If you don't want to bother with that yet, just disable wakeword detection and NLU, and you can leave out all the file paths above:

spokestack = Spokestack.Builder()
    .withoutWakeword()
    .withoutNlu()
    // ...
    .build()

In this case, you'll still need to start() Spokestack as above, but you'll also want to create a button somewhere that calls spokestack.activate() when pressed; this starts ASR, which transcribes user speech.

Alternately, you can set Spokestack to start ASR any time it detects speech by using a non-default speech pipeline profile as described in the speech pipeline documentation. In this case you'd want the VADTriggerAndroidASR profile:

// replace
.withoutWakeword()
// with
.withPipelineProfile("io.spokestack.spokestack.profile.VADTriggerAndroidASR")

Note also the addListener() line during setup. Speech processing happens continuously on a background thread, so your app needs a way to find out when the user has spoken to it. Important events are delivered via events to a subclass of SpokestackAdapter. Your subclass can override as many of the following event methods as you like. Choosing to not implement one won't break anything; you just won't receive those events.

speechEvent(SpeechContext.Event, SpeechContext): This communicates events from the speech pipeline, including everything from notifications that ASR has been activated/deactivated to partial and complete transcripts of user speech.
nluResult(NLUResult): When the NLU is enabled, user speech is automatically sent through NLU for classification. You'll want the results of that classification to help your app decide what to do next.
ttsEvent(TTSEvent): If you're managing TTS playback yourself, you'll want to know when speech you've synthesized is ready to play (the AUDIO_AVAILABLE event); even if you're not, the PLAYBACK_COMPLETE event may be helpful if you want to automatically reactivate the microphone after your app reads a response.
trace(SpokestackModule, String): This combines log/trace messages from every Spokestack module. Some modules include trace events in their own event methods, but each of those events is also sent here.
error(SpokestackModule, Throwable): This combines errors from every Spokestack module. Some modules include error events in their own event methods, but each of those events is also sent here.

The quickstart guide contains sample implementations of most of these methods.

As we mentioned, classification is handled automatically if NLU is enabled, so the main methods you need to know about while Spokestack is running are:

start()/stop(): Starts/stops the pipeline. While running, Spokestack uses the microphone to listen for your app's wakeword unless wakeword is disabled, in which case ASR must be activated another way. The pipeline should be stopped when Spokestack is no longer needed (or when the app is suspended) to free resources.
activate()/deactivate(): Activates/deactivates ASR, which listens to and transcribes what the user says.
synthesize(SynthesisRequest): Sends text to Spokestack's cloud TTS service to be synthesized as audio. Under the default configuration, this audio will be played automatically when available.

Development

Maven is used for building/deployment, and the package is hosted at Maven Central.

This package requires the Android NDK to be installed and the ANDROID_HOME and ANDROID_NDK_HOME variables to be set. On OSX, ANDROID_HOME is usually set to ~/Library/Android/sdk and ANDROID_NDK_HOME is usually set to ~/Library/Android/sdk/ndk/<version>.

ANDROID_NDK_HOME can also be specified in your local Maven settings.xml file as the android.ndk.path property.

Testing/Coverage

mvn test jacoco:report

Lint

mvn checkstyle:check

Release

Ensure that your Sonatype/Maven Central credentials are in your user settings.xml (usually ~/.m2/settings.xml):

<servers>
    <server>
        <id>ossrh</id>
        <username>sonatype-username</username>
        <password>sonatype-password</password>
    </server>
</servers>

On a non-master branch, run the following command. This will prompt you to enter a version number and tag for the new version, push the tag to GitHub, and deploy the package to the Sonatype repository.

mvn release:clean release:prepare release:perform

The Maven goal may fail due to a bug where it tries to upload the files twice, but the release has still happened.

Complete the process by creating and merging a pull request for the new branch on GitHub and updating the release notes by editing the tag.

For additional information about releasing see http://maven.apache.org/maven-release/maven-release-plugin/

License

Licensed under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at

  http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License.

*Note that all licence references and agreements mentioned in the Spokestack README section above are relevant to that project's source code only.

Spokestack

Extensible Android mobile voice framework: wakeword, ASR, NLU, and TTS. Easily add voice to any Android app!