An early overview of ICLR2018

04 Dec 2017

Most reviews for ICLR2018 are out and given the success of the past edition it is time for the next one :)

Update: I have published an overview of the decision process in part 2, and final decisions in part 3.

The sixth edition of ICLR will be held in Vancouver the 30th of April and it comes with some important changes:

This time submissions are double-blind.
There is only one review round.
Revisions are not allowed during the review period (last year some authors submitted “a draft” before the deadline and then continued working until the review).

Another difference with the last edition, is that review scores have dropped from a median of 6 to a median of 5 (5.69 avg -> 5.23 avg):

Could be this a consequence of the single review round, which helped the authors to empathize with their reviewers? Are low-hanging fruits running out? Is it the double-blind?

What about the cutt-off score to be accepted? If we take into account the 40% best of the 979 mean scores, it is 5.54.

I computed the paper score by averaging its review ratings weighted by the reviewer confidence:

papers = [{r=9,c=3}, {r=4,c=1}]
avg_score = 9 * (3 / (3 + 1)) + 4 * (1 / (3 + 1)) = 7.75

Anyway, let’s go straight to the point and see which are the top 10 rated papers.

Top 10 rated papers

#	Title	Authors	Rating	Std
1	Certifiable Distributional Robustness with Principled Adversarial Training	Anon	9.0	0.0
2	On the Convergence of Adam and Beyond	Anon	8.6	0.5
3	Deep Mean Field Games for Learning Optimal Behavior Policy of Large Populations	Anon	8.5	1.2
4	Multi-Scale Dense Networks for Resource Efficient Image Classification	Anon	8.3	1.2
5	Emergence of grid-like representations by training recurrent neural networks to perform spatial localization	Anon	8.3	0.5
6	i-RevNet: Deep Invertible Networks	Anon	8.1	0.8
7	Spherical CNNs	Anon	8.1	0.8
8	Wasserstein Auto-Encoders	Anon	8.0	0.0
9	Learning to Represent Programs with Graphs	Anon	8.0	0.0
10	Boosting Dilated Convolutional Networks with Mixed Tensor Decompositions	Anon	8.0	0.8

I recommend the reader to go through all of them, they all are inspiring and interesting. It can also be seen that reviewers are unanimous for most of them. This brings about another question, which are the submissions that create the most discord?

#	Title	Authors	Std	Min	Max
1	Progressive Growing of GANs for Improved Quality, Stability, and Variation	Anon	3.3	1	8
2	BLOCK-NORMALIZED GRADIENT METHOD: AN EMPIRICAL STUDY FOR TRAINING DEEP NEURAL NETWORK	Anon	3.0	3	9
3	Stabilizing Adversarial Nets with Prediction Methods	Anon	2.5	3	9
4	Emergent Complexity via Multi-Agent Competition	Anon	2.5	3	9
5	Multi-Agent Compositional Communication Learning from Raw Visual Input	Anon	2.4	3	9

The most extrema case is Progressive Growing of GANs for Improved Quality, Stability, and Variation, which is rated as trivial or wrong by a reviewer while another one rates it with an 8 (top 50%).

A great feature of OpenReview is that it allows for public comments. This can help us to discover other interesting papers, such as the most popular ones. However, comments do not always come from experts and thus it is difficult to tell whether they are relevant to assess the importance of a paper. In order to shed some light on the importance of comments vs reviews, I have related a simple sentiment analysis score on the comments with the score emitted by the reviewers:

As it can be seen, there is not much correlation between the sentiment of user comments and the rating of the reviewers (the size of the blue dots is the number of comments). Another factor is that the tool used for sentiment analysis might be too simple, although it succeeds for some of them:

(+) Deep Learning is Robust to Massive Label Noise: Good results, better than in literature.

(-) Hybed: Hyperbolic Neural Graph Embedding: These embeddings are not in the hyperbolic space as claimed…

Also, we don’t know how expert are the users who comment. However, does confidence change the way how reviewers rate?

Do confident reviewers rate differently?

The answer is yes as it can be seen in the following plot:

Confident reviewers use to emit more extreme votes (0-3, 9-10, 7-8), while those which are less confident tend to vote in the middle (4-6).

About the data

This time I have used the OpenReview API instead of a crawler. I will update it regularly until all the reviews are published.

http://search.iclr2018.smerity.com/
Yanguang has published a similar review.
Past edition

Go to part 2, 3

prlz77

An information compressor in a pale blue dot.

An early overview of ICLR2018

Top 10 rated papers

Do confident reviewers rate differently?

About the data

An early overview of ICLR2018

Top 10 rated papers

Do confident reviewers rate differently?

About the data

Related Posts

Related Posts