- 08 Sep, 2021 1 commit
-
-
Dan Ellis authored
-
- 08 Jun, 2021 1 commit
-
-
Dan Ellis authored
The original Colab by malcolmslaney didn't work with the current VGGish/tensorflow. Changed the link to an updated version.
-
- 22 Oct, 2020 1 commit
-
-
Manoj Plakal authored
-
- 18 Aug, 2020 1 commit
-
-
Manoj Plakal authored
* Added TF-Lite-compatible feature extractor and model exporter for YAMNet. - Added a TF-Lite compatible feature extractor. With the latest TF-Lite, that involves a DFT-multiplication replacement for tf.abs(tf.signal.stft()) and not a lot else. Note that TF-Lite now allows variable-length inputs. - Added a YAMNet exporter that produces TF2 SavedModels, TF-Lite models, and TF-JS models. - Cleanups: switched hyperparameters to a dataclass, got rid of some lingering cruft in yamnet_test. * Responded to DAn's comments in https://github.com/tensorflow/models/pull/9098 - Switched some hparams to float - Made class map asset available on the exported model, and tested that it can be loaded from the various exports.
-
- 12 Aug, 2020 1 commit
-
-
Manoj Plakal authored
* Input/Output tweaks for YAMNet and VGGish. - Waveform input for YAMNet is now padded so that we get at least one patch of log mel spectrogram. The VGGish TF-Hub exporter uses YAMNet's feature computation so the VGGish export will also pad waveform input similarly. - Added a 1024-D embedding output to YAMNet so we now produce predicted scores, log mel spectrogram features, and embeddings, to satisfy a variety of uses: class prediction, acoustic feature visualization, semantic feature extraction. - Simplified usage of YAMNet in inference mode. Instead of trying to work around implicit batch size issues in the Model.predict() API, we simply __call__() the Model. - Switched inference.py to TF 2 and Eager execution. - Updated the visualization notebook: now uses TF2/Eager and can be loaded and run in Google Colab. * Responded to DAn's comments in https://github.com/tensorflow/models/pull/9092 - Merged spectrogram computation and framing into a single function that returns both spectrogram and framed features. - Extended waveform padding to pad up to an integral number of hops in addition to the final STFT analysis window.
-
- 10 Aug, 2020 2 commits
-
-
Manoj Plakal authored
* Added a TF-Hub-compatible SavedModel exporter for VGGish. * Responded to DAn's comments, switched linspace to arange everywhere.
-
Manoj Plakal authored
Fixed a long-standing bug where the released VGGish model used post-activation embedding output while the released embeddings were pre-activation. There are still discrepancies due to other reasons: differences in choice of YouTube transcode, repeated resamplings with different resamplers, slight differences in feature computation, etc.
-
- 09 Aug, 2020 1 commit
-
-
Manoj Plakal authored
* Made VGGish and YAMNet work in TF2 without disabling TF2 behavior. Allowed TF2 behavior and allowed passing in a features tensor into the VGGish model definition. Both of these changes are needed for making TF-Hub exports of these models. Lifted constraints on TF versions since tf_slim has been updated to work with TF 2. * Responded to DAn's comments in https://github.com/tensorflow/models/pull/9077 * Fixed typo in comment.
-
- 17 Apr, 2020 1 commit
-
-
Dan Ellis authored
-
- 13 Apr, 2020 1 commit
-
-
Jaeyoun Kim authored
* Update README.md No Maintenance Intended * Update README.md * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md No Maintenance Intended * Create README.md No Maintenance Intended * Update README.md No Maintenance Intended * Update README.md...
-
- 26 Feb, 2020 1 commit
-
-
Dan Ellis authored
-
- 22 Feb, 2020 1 commit
-
-
Dan Ellis authored
* Modify yamnet_visualization to work for any (?) sampling rate in input wav file.
-
- 20 Feb, 2020 1 commit
-
-
Dan Ellis authored
-
- 18 Jan, 2020 1 commit
-
-
Manoj Plakal authored
- Made code work with either TF v1.x or TF v2.x, while explicitly enabling v1.x behavior.l - Pulled slim from tf_slim package instead of through tensorflow contrib. Note that tf_slim itself uses tensorflow contrib so it requires using TF v1.x for now (referenced a relevant PR which should remove this limitation once it gets merged). - Removed all mention of scipy. Switched wav writing to soundfile. - Switched package name to soundfile instead of pysoundfile. The former is the newer name. - Updated installation instructions for both vggish and yamnet to reflect these changes. - Tested new installation procedures. vggish works with TF v1.15, yamnet works with TF v1.15.0 as well as TF v2.1.0.
-
- 14 Jan, 2020 1 commit
-
-
Dan Ellis authored
-
- 13 Jan, 2020 3 commits
- 21 Nov, 2019 1 commit
-
-
Dan Ellis authored
Add files for YamNet, a sound event classifier.
-
- 13 Jun, 2019 1 commit
-
-
Manoj Plakal authored
* Moved VGGish code into its own directory. * Moved most of old README.md into vggish/README.md.
-
- 12 Nov, 2018 1 commit
-
-
Samuel Neugber authored
-
- 08 Nov, 2018 1 commit
-
-
Samuel Neugber authored
-
- 05 Nov, 2018 1 commit
-
-
David T.H. Kao authored
-
- 04 Nov, 2018 1 commit
-
-
David Kao authored
-
- 20 Sep, 2018 1 commit
-
-
Souradip Mookerjee authored
-
- 15 Aug, 2018 1 commit
-
-
Malcolm Slaney authored
Added a pointer to a Colab that illustrates how to use the public AudioSet models to generate embeddings for user-specified sounds.
-
- 13 Feb, 2018 3 commits
-
-
Manoj Plakal authored
To simplify syncing this c ode back into Google.
-
Manoj Plakal authored
-
Manoj Plakal authored
Clamp the requested frequency range to [0, Nyquist].
-
- 09 Feb, 2018 1 commit
-
-
Manoj Plakal authored
TF Saver now requires specifying the checkpoint version even when restoring to avoid errors when we specify a path to a checkpoint file instead of a directory.
-
- 15 Oct, 2017 1 commit
-
-
Quentin Pleplé authored
-
- 12 Oct, 2017 1 commit
-
-
Manoj Plakal authored
Also added 'sudo' to installation instructions.
-
- 23 Sep, 2017 1 commit
-
-
Neal Wu authored
-
- 21 Sep, 2017 1 commit
-
-
Neal Wu authored
-