1.0.0-beta7

Version 1.0.0-beta7

Read the announcement at https://blog.konduit.ai/2020/05/14/deeplearning4j-1-0-0-beta7-released/ for the highlights of this release.

Deeplearning4j

Features and Enhancements

  • Added Keras model import support for tf.keras models Link, Link
    • Full inference and training support is available for ops/layers in the tf.keras namespace; inference only for general Tensorflow operations outside of the tf.keras namespace
    • Note also improvements to Keras import for reshape, permute, etc operations due to NHWC and NWC support in DL4J
  • DL4J now supports NHWC (channels last) data format for all CNN 2D layers, in addition to NCHW Link
  • DL4J now supports NWC (channels last - [minibatch, sequence_length, size]) for all RNN and CNN 1D layers, in addition to NCW Link
  • Added Deconvolution3D layer Link
  • Keras import: added ReLU, ELU and Softmax advanced activation layers Link and Swish activation function Link
  • Added DL4J SameDiffLoss class (for easily-defined DL4J ILossFunction's via SameDiff) Link
  • Useful exceptions are now thrown when attempting to perform unsupported operations on FastText Link
  • Added MultiLayerNetwork.evaluate(MultiDataSetIterator) and .evaluateRegression(MultiDataSetIterator) methods Link, Link

Bug Fixes and Optimizations

  • Updaters (Adam, AdaGrad, etc) optimized via C++ operations (significant training performance boost) for DL4J and SameDiff Link, Link
  • Some packages relocated to avoid split packages (that can be a problem for OSGi and Java 9 modules) Link
    • Note: this is a breaking change for some class packages/imports. See this link for details on exact package changes
  • Deeplearning4j UI: Webjars versions locked down using dependency management to avoid check on each build Link
  • Added MKLDNN (DNNL/OneDNN) support for depthwise_conv2d operation for DL4J and SameDiff Link
  • Refactored/merged modules dl4j-perf and dl4j-util into deeplearning4j-core Link
  • Fixed an issue with BertWordPieceTokenizer - potential StackOverflowError with certain inputs Link
  • Fixed an issue with GlobalPooling layer with masks of different datatype to the activations datatype Link
  • Fixed an issue with DL4JModelValidator for ComputationGraph Link
  • Fixed an issue where SameDiff layers in DL4J could throw an exception when used with transfer learning Link
  • Weight initialization for EmbeddingLayer and EmbeddingSequenceLayer now no longer depend on the vocabulary size (only the vector size) Link
  • Fixed an issue with Keras import with bidirectional layers + preprocessors Link
  • DL4J UI: added redirect from /train to /train/overview Link
  • Fixed an issue where RecordReaderDataSetIterator builder collectMetaData configuration was not being applied Link
  • Fixed an issue where MultiLayerNetwork evaluation was not passing metadata to the IEvaluation instances during evaluation Link, Link
  • Fixed an issue with Spark training SharedTrainingMaster when training with a ComputationGraph and MultiDataSets Link
  • Assorted fixes for edge cases for DL4J Keras import Link
  • deelpearning4j-nlp-korean will no longer be released for Scala 2.12 due to required dependency only having Scala 2.11 version avairable Link
  • Fix for ConvolutionalIterationListener for ComputationGraph Link
  • Fixed an issue where dataset and model zoo downloads could get stuck if the server fails to send any data (now: timeout + retry) Link
  • DL4J ModelSerializer no longer writes temporary files when restoring models from InputStream Link
  • Fixes issues with UIServer multi session mode, and potential shutdown race condition Link
  • Fixed an issue where TfidfVectorizer.vectorize() could throw a NPE when fit from LabelAwareIterator Link

ND4J/SameDiff:

Features and Enhancements

  • SameDiff multi-threaded inference enhanced (and fixed) - a single SameDiff instance can now be used for inference safely and efficiently from multiple threads Link Link
  • cuDNN support added to SameDiff (automatically enabled for nd4j-cuda-10.x backend) Link
  • Added ND4J namespaces: Nd4j.cnn, Nd4j.rnn, Nd4j.image Link
  • Added new Image operations namespace operations:
    • rgbToHsv, hsvToRgb Link
    • rgbToYiq, yiqToRgb, rgbToYuv, yuvToRgb Link
    • imageResize Link
  • Added new Random operations namespace operations:
    • gamma, poisson, shuffle Link
  • Added new Math namespace operations:
    • clipByAvgNorm, embeddingLookup Link
    • mergeMaxIndex Link
  • Added new NN namespace operations:
  • Added new CNN namespace operations:
    • upsampling3d Link
  • Added new linalg operations namespace
    • triangular_solve Link
    • tri operation Link
    • triu operation Link
  • Added new RNN operation namespace operations:
    • lstmLayer (note old lstmLayer method renamed to lstmBlock) Link
    • gru Link
  • Added new Loss operations namespace - Nd4j.loss Link
  • Mapped operations for Tensorflow import:
    • HSVToRGB, RGBToHSV, Igamma, Igammac, RandomGamma, RandomPoisson, RandomPoissonV2, RandomShuffle Link
  • Added SameDiff ProfilingListener - writes op performance profiles in Chrome profiler format (load in chrome://tracing/) Link Link
  • Added SameDiff ProfileAnalyzer tool to compare profiles output from ProfilingListener (or Tensorflow) Link Link
  • SameDiff listener API: added frame and iteration information for listener methods Link Link
  • Added (non-backend-specific) method of accessing Nd4j environment: Nd4j.getEnvironment() method (environment info and low-level configuration options) Link Link
  • Improved memory limits/configuration support for libnd4j (c++) Link
  • Added pairwise (broadcastable) power backprop operation Link
  • Updated JavaCPP presets MKL version to 2020.0 from 2019.5 Link
  • Added DynamicCustomOp dargs - datatype arguments Link Link
    • Output datatype configuration for Range op Link, SequenceOp Link, ConfusionMatrix Link
  • Added tensormmul_bp op Link
  • OpenBLAS version upgraded to 0.3.8 Link
  • libnd4j (c++ codebase underlying DL4J, ND4J and SameDiff) refactored to be more easily embeddable in other C++ projects Link
  • ImagePreProcessingScaler now supports preprocessing of labels (for segmentation) Link
  • Additional datatypes now supported for nd4j-tensorflow TensorflowConversion Link
  • SameDiff operation namespaces (sd.math, sd.image, etc) are now code generated to ensure SameDiff and ND4J namespaces are identical (all operations included, same API) Link
  • Added ND4J ArchiveUtils.unzipFileTo(String, String, boolean logFiles) overload to enable/disable extracted file path logging Link
  • Added weight format configuration for following operations: conv1D, conv2D, conv3D, deconv2d, deconv3d, depthwiseConv2d, pointwiseConv2d, sconv2d Link
  • Added backprop operation implementations for mergemax, mergeadd, mergeavg operations Link
  • MKL version upgraded to 2020.0 2020.1; OpenCV upgraded from 4.2.0 to 4.3.0 Link
  • SameDiff: DifferentialFunctionFactory class removed in favor of namespace methods (sd.math, sd.linalg, etc) Link
  • Added lstmLayer_bp operation Link
  • Added gru_bp operation Link
  • linspace operation can now use both targs and arrays for start/end/size arguments Link
  • Assorted dependency updates - OpenBLAS (0.3.9), OpenCV (4.3.0), Leptonica (1.79.0) Link
  • Upgraded assorted dependency versions: javax.activation:activation (1.1 -> 1.1.1), stream analytics (2.7.0->2.9.8), Apache Spark (2.4.3->2.4.5), Jackson databind (2.10.1 -> 2.10.3), Vertx (3.8.3 -> 3.9.0) Link
  • Added nd4j-common-tests ResourceUtils.listClassPathfiles method Link

Bug Fixes and Optimizations

  • Updaters (Adam, AdaGrad, etc) optimized via C++ operations (significant training performance boost) for DL4J and SameDiff Link, Link
  • SameDiff - added CuDNN support Link
  • Some packages relocated to avoid split packages (that can be a problem for OSGi and Java 9 modules) Link
    • Note: this is a breaking change for some class packages/imports. See this link for details on exact package changes
  • Fixed some issues with Tensorflow import of FusedBatchNorm operation Link
  • Fixed an issue where the Roll operation did not match Tensorflow operation Link Link
  • Fixed an issue where ArchiveUtils could fail to create the top level destination directory when it does not exist Link
  • Fixed an issue where resize_bicubic operation did not match Tensorflow for some configuration values Link Link
  • Pad operation now supports long/int64 values for padding array Link Link
  • Fixed an issue where hashcode operation shape function wasn't always returning int64/long dtype Link
  • Fixed an issue with reshape operation on empty arrays with -1s Link Link
  • Improved performance on CUDA for concat operation Link and CPU/GPU Link
  • Improved performance for bias_add operation
    • On CPU for NHWC case Link
    • Generally Link
    • On CUDA for 2D case Link
  • Added MKLDNN (DNNL/OneDNN) support for depthwise_conv2d operation for DL4J and SameDiff Link
  • Fixed a small SameDiff execution issue for switch operation where the predicate is a constant Link
  • Fixed an issue with batchnorm operation when input arrays have unusual strides Link
  • Merged nd4j-buffer, nd4j-content modules into nd4j-api Link
  • Deleted deprecated nd4j-jackson module (remaining functionality available in nd4j-api) Link
  • Deleted unused/unmaintained nd4j-camel and nd4j-gson modules Link
  • Optimization for legacy random ops Link
  • Optimization for broadcast operations Link, Link, Link, Link, Link
  • Performance optimization for multiple operations: softmax, squeeze, expand_dims, tanh Link
  • Optimization for transpose/permute operations Link
  • Performance enhancement: MKLDNN matmul used for some mmul operation cases Link
  • Optimization for gather operation on CPU Link
  • Optimization for stack/unstack operations on CPU Link
  • Optimization for split operation (CPU and CUDA) Link Link
  • ND4J initialization no longer logs number of OpenMP BLAS threads for CUDA Link
  • Optimization: Fixed issues with auto-vectorization on multple CPU operations Link
  • Optimization for reshape operation Link, Link
  • Fixed an issue where INDArray.hashCode() could cause an exception on some datatypes Link
  • Optimization for CPU: MKLDNN is now used for softmax, tanh, softmax_bp and tanh_bp operations Link, Link, Link, Link
  • Fixed random_exponential operation Link
  • Improved performance on C++ SameDiff graph execution via reduced array zeroing where safe to do so Link
  • Improved C++ indexing implementation impacting CPU performance on some operations Link
  • Fixed an issue where Split operation could have incorrect output shapes for empty arrays Link
  • Fixed some issues with SameDiff.equals method Link
  • Fixed an issue with reshape operation output shape on empty arrays Link, Link
  • Nd4j.gemm now uses Mmul operation internally to avoid potential threading issues with direct BLAS calls on CUDA Link
  • Fixed an edge case issue with percentile operation link
  • Fixed an edge case issue for cusolved (CUDA) in libnd4j Link
  • Fixed an issue with error formatting for segment operations for incorrect lengths Link
  • Fixed an issue where ND4J workspaces were not guaranteed to be unique Link
  • Fixed some operation implementations when operating on views (Batch/Space to Space/Batch/Depth; batchnorm_bp) Link
  • Fixed an issue where exponential distribution random number generation operation could produce infinities extremely rarely (~1 in 10^9 values) Link
  • Fixed an issue with long file paths for memory mapped workspaces on Windows Link
  • Memory for memory mapped workspaces are now deallocated immediately when workspace is destroyed, instead of waiting for GC to free memory Link
  • Fall-back to other BLAS implementation for cases where MKLDNN GEMM implementation is slow Link
  • Set nd4j-native source/target to Java 7 Link, Link

DataVec

Features and Enhancements

  • datavec-python: added zero-copy support for bytes/byte buffers Link
  • datavec-python: Python exceptions are now thrown as Java exceptions Link
  • datavec-python: Added support for additional NumPy datatypes Link
  • datavec-python: Python version upgraded from 3.7.6 to 3.7.7 Link

Bug Fixes and Optimizations

  • Deleted not properly maintained modules: datavec-camel, datavec-perf Link
  • Fixed missing BOOL datatype support for arrow conversion functionality Link
  • Assorted fixes for datavec-python Link Link, Link
  • Fixed an issue with LineRecordReader where initialization was performed unnecessarily (adding performance overhead) Link

RL4J

Features and Enhancements

  • Refactoring to decouple configuration and learning methods from their implementations Link
  • Added builder patterns for all configuration classes Link

Arbiter

Bug Fixes and Optimizations

  • Fixes an issue with GridSearchCandidateGenerator not working correctly for some cases Link, Link
Last modified 3mo ago