- 22 Oct, 2020 1 commit
-
-
Manoj Plakal authored
-
- 18 Aug, 2020 1 commit
-
-
Manoj Plakal authored
* Added TF-Lite-compatible feature extractor and model exporter for YAMNet. - Added a TF-Lite compatible feature extractor. With the latest TF-Lite, that involves a DFT-multiplication replacement for tf.abs(tf.signal.stft()) and not a lot else. Note that TF-Lite now allows variable-length inputs. - Added a YAMNet exporter that produces TF2 SavedModels, TF-Lite models, and TF-JS models. - Cleanups: switched hyperparameters to a dataclass, got rid of some lingering cruft in yamnet_test. * Responded to DAn's comments in https://github.com/tensorflow/models/pull/9098 - Switched some hparams to float - Made class map asset available on the exported model, and tested that it can be loaded from the various exports.
-
- 12 Aug, 2020 1 commit
-
-
Manoj Plakal authored
* Input/Output tweaks for YAMNet and VGGish. - Waveform input for YAMNet is now padded so that we get at least one patch of log mel spectrogram. The VGGish TF-Hub exporter uses YAMNet's feature computation so the VGGish export will also pad waveform input similarly. - Added a 1024-D embedding output to YAMNet so we now produce predicted scores, log mel spectrogram features, and embeddings, to satisfy a variety of uses: class prediction, acoustic feature visualization, semantic feature extraction. - Simplified usage of YAMNet in inference mode. Instead of trying to work around implicit batch size issues in the Model.predict() API, we simply __call__() the Model. - Switched inference.py to TF 2 and Eager execution. - Updated the visualization notebook: now uses TF2/Eager and can be loaded and run in Google Colab. * Responded to DAn's comments in https://github.com/tensorflow/models/pull/9092 - Merged spectrogram computation and framing into a single function that returns both spectrogram and framed features. - Extended waveform padding to pad up to an integral number of hops in addition to the final STFT analysis window.
-
- 09 Aug, 2020 1 commit
-
-
Manoj Plakal authored
* Made VGGish and YAMNet work in TF2 without disabling TF2 behavior. Allowed TF2 behavior and allowed passing in a features tensor into the VGGish model definition. Both of these changes are needed for making TF-Hub exports of these models. Lifted constraints on TF versions since tf_slim has been updated to work with TF 2. * Responded to DAn's comments in https://github.com/tensorflow/models/pull/9077 * Fixed typo in comment.
-
- 17 Apr, 2020 1 commit
-
-
Dan Ellis authored
-
- 26 Feb, 2020 1 commit
-
-
Dan Ellis authored
-
- 22 Feb, 2020 1 commit
-
-
Dan Ellis authored
* Modify yamnet_visualization to work for any (?) sampling rate in input wav file.
-
- 20 Feb, 2020 1 commit
-
-
Dan Ellis authored
-
- 18 Jan, 2020 1 commit
-
-
Manoj Plakal authored
- Made code work with either TF v1.x or TF v2.x, while explicitly enabling v1.x behavior.l - Pulled slim from tf_slim package instead of through tensorflow contrib. Note that tf_slim itself uses tensorflow contrib so it requires using TF v1.x for now (referenced a relevant PR which should remove this limitation once it gets merged). - Removed all mention of scipy. Switched wav writing to soundfile. - Switched package name to soundfile instead of pysoundfile. The former is the newer name. - Updated installation instructions for both vggish and yamnet to reflect these changes. - Tested new installation procedures. vggish works with TF v1.15, yamnet works with TF v1.15.0 as well as TF v2.1.0.
-
- 14 Jan, 2020 1 commit
-
-
Dan Ellis authored
-
- 21 Nov, 2019 1 commit
-
-
Dan Ellis authored
Add files for YamNet, a sound event classifier.
-