New Architectures for Very Deep Learning

Staff - Faculty of Informatics

Date: 1 February 2018 / 14:30 - 16:00

USI Lugano Campus, room SI-006, Informatics building (Via G. Buffi 13)

You are cordially invited to attend the PhD Dissertation Defense of Rupesh Kumar SRIVASTAVA on Thursday, February 1st 2018 at 14h30 in room SI-006 (Informatics building)

 

Abstract:

Artificial Neural Networks are increasingly being used in complex real-world applications because many-layered (i.e., deep) architectures can now be trained on large quantities of data. However, training even deeper, and therefore more powerful networks, has hit a barrier due to fundamental limitations in the design of existing networks. This thesis develops new architectures that, for the first time, allow very deep networks to be optimized efficiently and reliably. Specifically, it addresses two key issues that hamper credit assignment in neural networks: cross-pattern interference and vanishing gradients.

Cross-pattern interference leads to oscillations of the network's weights that make training inefficient. The proposed Local Winner-Take-All networks reduce interference among computation units in the same layer through local competition. An in-depth analysis of locally competitive networks provides generalizable insights and reveals unifying properties that improve credit assignment.

As network depth increases, vanishing gradients make a network's outputs increasingly insensitive to the weights close to the inputs, causing the failure of gradient-based training. To overcome this limitation, the proposed Highway networks regulate information flow across layers through additional skip connections which are modulated by learned computation units. Their beneficial properties are extended to the sequential domain with Recurrent Highway Networks that gain from increased depth and learn complex sequential transitions without requiring more parameters.

 

Dissertation Committee:

  • Prof. Jürgen Schmidhuber, Università della Svizzera italiana/IDSIA, Switzerland (Research Advisor)
  • Prof. Michael Bronstein, Università della Svizzera italiana, Switzerland (Internal Member)
  • Prof. Antonio Carzaniga, Università della Svizzera italiana, Switzerland (Internal Member)
  • Prof. Sepp Hochreiter, Johannes Kepler University Linz, Austria (External Member)
  • Prof. Ruslan Salakhutdinov, Carnegie Mellon University, USA (External Member)

Faculties

Events
22
July
2024
22.
07.
2024
30
July
2024
30.
07.
2024
01
August
2024
01.
08.
2024
13
August
2024
13.
08.
2024

Cinema and Audiovisual Futures Conference 2024

Faculty of Communication, Culture and Society

The Future of Survival Public Event: AI and Generative humanity

Faculty of Communication, Culture and Society
14
August
2024
14.
08.
2024

The Future of Survival Public Event: Digital Migrations

Faculty of Communication, Culture and Society