1.0.0-beta3

Highlights - 1.0.0-beta3 Release

  • ND4J/Deeplearning4j: Added support for CUDA 10.0. Dropped support for CUDA 8.0. (1.0.0-beta3 release has CUDA 9.0, 9.2 and 10.0 support)

  • SameDiff now supports training and evaluation from DataSetIterator and MultiDataSetIterator. Evaluation classes have been moved to ND4J.

  • DL4J Spark training (gradient sharing) is now fully fault tolerant, and has improvements for threshold adaption (potentially more robust convergence). Ports can now be easily configured independently on master/workers.

Deeplearning4J

Deeplearning4J: New Features

  • Added OutputAdapter interface and MultiLayerNetwork/ComputationGraph.output method overloads using OutputAdapter (avoids allocating off-heap memory that needs to be cleaned up by GC) Linkarrow-up-right, Linkarrow-up-right, Linkarrow-up-right

  • Added ComputationGraph/MultiLayerNetwork rnnTimeStep overload with user-specified workspace. Linkarrow-up-right

  • Added Cnn3DLossLayer Linkarrow-up-right

  • ParallelInference: Instances can now update the model in real-time (without re-init) Linkarrow-up-right

  • ParallelInferenc: Added ParallelInference INPLACE mode Linkarrow-up-right

  • Added validation for incompatible loss/activation function combinations (such as softmax+nOut=1, or sigmoid+mcxent). New validation can be disabled using outputValidation(false) Linkarrow-up-right

  • Spark training: Added full fault tolerance (robust failure recovery) for gradient sharing implementation Linkarrow-up-right Linkarrow-up-right

  • Spark training now supports configuring ports more flexibly (and differently for different workers) using PortSupplier Linkarrow-up-right Linkarrow-up-right

  • Spark training: overhauled gradient sharing threshold adaption algorithms; made it possible to customize threshold settings, plus made defaults more robust to initial threshold configuration improving convergence speed in some cases. Linkarrow-up-right

  • Spark training: implemented chunked messaging to reduce memory requirements (and insufficient buffer length issues) for large messages Linkarrow-up-right

  • Spark training: Added MeshBuildMode configuration for improved scalability for large clusters Linkarrow-up-right

  • Spark network data pipelines: added FileBatch, FileBatchRecordReader etc for "small files" (images etc) distributed training use cases Linkarrow-up-right

  • Added FailureTestingListener for fault tolerance/debugging purposes Linkarrow-up-right

  • Upgraded Apache Lucene/Solr to version 7.5.0 (from 7.4.0) Linkarrow-up-right

  • Added system properties (org.deeplearning4j.tempdir and org.nd4j.tempdir) to allow overriding of the temporary directories ND4J and DL4J use Linkarrow-up-right Linkarrow-up-right

  • Mode MultiLayerNetwork/ComputationGraph.clearLayerStates methods public (was protected) Linkarrow-up-right

  • AbstactLayer.layerConf() method is now public Linkarrow-up-right

  • ParallelWrapper module now no longer has a Scala version suffix for artifact id; new artifact id is deeplearning4j-parallel-wrapper Linkarrow-up-right

  • Improved validation and error mesages for invalid inputs/labels in Yolo2OutputLayer Linkarrow-up-right

  • Spark training: added SharedTrainingMaster.Builder.workerTogglePeriodicGC and .workerPeriodicGCFrequency to easily configure the ND4J garbage collection configuration on workers. Set default GC to 5 seconds on workers Linkarrow-up-right

  • Spark training: added threshold encoding debug mode (logs current threshold and encoding statistics on each worker during training). Enable using SharedTrainingConfiguration.builder.encodingDebugMode(true). Note this operation has computational overhead. Linkarrow-up-right

Deeplearning4J: Bug Fixes and Optimizations

  • Fixed an issue where L1/L2 and updaters (Adam, Nesterov, etc) were applied before dividing gradients by minibatch to obtain average gradient. To maintain old behaviour, use NeuralNetConfiguration.Builder.legacyBatchScaledL2(true) Linkarrow-up-right.

    • Note that learning rates may need to be decreased for some updaters (such as Adam) to account for this change vs. earlier versions. Some other updaters (such as SGD, NoOp, etc) should be unaffected.

    • Note that deserialized (loaded) configurations/networks saved in 1.0.0-beta2 or earlier will default to old behaviour for backward compatibility. All new networks (created in 1.0.0-beta3) will default to the new behaviour.

  • Fixed an issue where EarlyStoppingScoreCalculator would not correctly handle "maximize score" cases instead of minimizing Linkarrow-up-right

  • Fixed order (BGR vs. RGB) for VGG16ImagePreProcessor channel offset values Linkarrow-up-right

  • Fixed bug with variational autoencoders using weight noise Linkarrow-up-right

  • Fixed issue with BaseDataSetIterator not respecting the 'maximum examples' configuration Linkarrow-up-right

  • Optimization: A workspace is now used for ComputationGraph/MultiLayerNetwork evaluation methods (avoids allocating off-heap memory during evaluation that must be cleaned up by garbage collector) Linkarrow-up-right

  • Fixed an issue where shuffling combined with a subset for MnistDataSetIterator would not maintain the same subset between resets Linkarrow-up-right

  • Fixed issue with StackVertex.getOutputType Linkarrow-up-right

  • Fix issue with CNN to/from RNN preprocessors handling of mask arrays Linkarrow-up-right

  • Fixed issue with VGG16 non-pretrained configuration in model zoo Linkarrow-up-right

  • Fixed issue with TransferLearning nOutReplace where multiple layers in a row are modified Linkarrow-up-right

  • Fixed issue with CuDNN workspaces where backpropagation is performed outside of a standard fit call Linkarrow-up-right

  • Fixed an issue with dropout masks being cleared prematurely on output layers in ComputationGraph Linkarrow-up-right

  • RecordReaderMultiDataSetIterator now supports 5D arrays (for 3D CNNs) Linkarrow-up-right

  • Fixed bug in multi input/output ComputationGraphs with TBPTT combined with both masking and different number of input/output arrays Linkarrow-up-right

  • Improved input validation/exceptions for batch normalization layer Linkarrow-up-right

  • Fixed bug with TransferLearning GraphBuilder nOutReplace when combined with subsampling layers Linkarrow-up-right

  • SimpleRnnParamInitializer now properly respects bias initialization configuration Linkarrow-up-right

  • Fixed SqueezeNet zoo model non-pretrained configuration Linkarrow-up-right

  • Fixed Xception zoo model non-pretrained configuration Linkarrow-up-right

  • Fixed an issue with some evaluation signatures for multi-output ComputationGraphs Linkarrow-up-right

  • Improved MultiLayerNetwork/ComputationGraph summary method formatting for large nets Linkarrow-up-right

  • Fixed an issue where gradient normalization could result in NaNs if gradient is exactly 0.0 for all parameters in a layer Linkarrow-up-right

  • Fixed an issue where MultiLayerNetwork/ComputationGraph.setLearningRate could throw an exception for SGD and NoOp updaters Linkarrow-up-right

  • Fixed an issue with StackVertex plus masking in some rare cases Linkarrow-up-right

  • Fixed an issue with JSON deserialization of frozen layers in pre-1.0.0-alpha format Linkarrow-up-right

  • Fixed an issue where GraphBuilder.removeVertex can fail under some limited circumstances Linkarrow-up-right

  • Fixed a bug in CacheableExtractableDataSetFetcher Linkarrow-up-right

  • DL4J Spark training: Fixed issues with thread/device affinity for multi-GPU training + evaluation Linkarrow-up-right

  • DL4J Spark training: Made all Aeron threads daemon threads to prevent Aeron from stopping JVM shutdown when all other threads have completed Linkarrow-up-right

  • Added cudnnAllowFallback configuration for BatchNormalization layer (fallback to built-in implementation if CuDNN fails unexpectedly) Linkarrow-up-right

  • Fixed some rare concurrency issues with multi-worker (multi-GPU) nodes for Spark training Linkarrow-up-right Linkarrow-up-right

  • Fixed an issue with BatchNormalization layers that prevented the mean/variance estimates from being synced properly on each worker for GradientSharing training, causing convergence issues Linkarrow-up-right

  • Added a check to detect ZipSlip CVE attempts in ArchiveUtils Linkarrow-up-right

  • DL4J Spark training and evaluation: methods now use Hadoop Configuration from Spark context to ensure runtime-set configuration is available in Spark functions reading directly from remote storage (HDFS etc) Linkarrow-up-right

  • MultiLayerNetwork and ComputationGraph now properly support more than Integer.MAX_VALUE parameters Linkarrow-up-right Linkarrow-up-right

  • Added data validation for Nd4j.readTxt - now throws exception on invalid input instead of returning incorrect values Linkarrow-up-right

  • Fixed an issue with KNN implementation where a deadlock could occur if an invalid distance function (one returning "distances" less than 0) was utilized Linkarrow-up-right

  • Added synchronization to loading of Keras import models to avoid thread safety issues in the underlying HDFS library used for loading Linkarrow-up-right

  • Fixed rare issue for Async(Multi)DataSetIterator with large prefetch values Linkarrow-up-right

Deeplearning4J: API Changes (Transition Guide): 1.0.0-beta2 to 1.0.0-beta3

  • IEvaluation classes in DL4J have been deprecated and moved to ND4J so they are available for SameDiff training. Functionality and APIs are unchanged

  • MultiLayerConfiguration/ComputationGraphConfiguration pretrain(boolean) and backprop(boolean) have been deprecated and are no longer used. Use fit and pretrain/pretrainLayer methods instead. Linkarrow-up-right

  • ParallelWrapper module now no longer has a Scala version suffix for artifact id; new artifact id is deeplearning4j-parallel-wrapper which should be used instead Linkarrow-up-right

  • deeplearning4j-nlp-korean module now has Scala version suffix due to scala dependencies; new artifact ID is deeplearning4j-nlp-korean_2.10 and deeplearning4j-nlp-korean_2.11 Linkarrow-up-right

Deeplearning4J: Known issues: 1.0.0-beta3

  • Running multiple Spark training jobs simultaneously on the one physical node (i.e., multiple JVMs from one or more Spark jobs) may cause problems with network communication. A workaround for this is to manually set a unique stream ID manually in the VoidConfiguration. Use a unique (or random) integer value for different jobs Linkarrow-up-right

Deeplearning4J: Keras Import

ND4J

ND4J: New Features

ND4J: Bug Fixes and Optimizations

ND4J: API Changes (Transition Guide): 1.0.0-beta2 to 1.0.0-beta3

  • CUDA 8.0 support has been removed. CUDA 9.0, 9.2 and 10.0 support is available in 1.0.0-beta3

  • nd4j-base64 module contents have been deprecated; use the equivalent classes in nd4j-api from now on Linkarrow-up-right

  • Some classes in nd4j-jackson module has been deprecated; use the equivalent classes in nd4j-api from now on Linkarrow-up-right

ND4J: Known issues: 1.0.0-beta3

  • Android users may need to manually exclude the (now deprecated) module nd4j-base64. This is due to org.nd4j.serde.base64.Nd4jBase64 class being present in both nd4j-api and nd4j-base64 modules. Both versions have identical content. Use exclude group: 'org.nd4j', module: 'nd4j-base64' to exclude.

DataVec

DataVec: New Features

  • Added NativeImageLoader method overloads for org.opencv.core.Mat and String as filename Linkarrow-up-right

DataVec: Optimizations and Bug Fixes

  • Fix for JDBCRecordReader handling of null values Linkarrow-up-right

  • Improved errors/validation for ObjectDetectionRecordReader for invalid input (where image object centers are outside of image bounds) Linkarrow-up-right

  • Fixed issue where FileSplit using methods that are unavailable on earlier versions of Android Linkarrow-up-right

  • Added SerializableHadoopConfiguration and BroadcastHadoopConfigHolder for cases where a Hadoop configuration is required in Spark functions Linkarrow-up-right Linkarrow-up-right

  • Fixed issue with JDBCRecordReader's handling of real-valued column result types Linkarrow-up-right

  • Added validation and useful exception for CSVRecordReader/LineRecordReader being used without initialization Linkarrow-up-right

Arbiter

Arbiter: Fixes

ND4S

  • Added conversion between org.nd4j.linalg.primitives.Pair/Triple and Scala Tuple Linkarrow-up-right

Last updated

Was this helpful?