My NIPS write up

You can connect with me on LinkedIn to discuss collaborations and work opportunities.

You can also follow me on Twitter, Bluesky and Mastodon.

Just as a quick disclaimer, this post is about my personal experience and opinions at NIPS 2017, and I'm not an AI researcher, I work as a data scientist in the industry. For a more technical summary of the talks and papers presented, you may want to check this document by David Abel.

Deep learning rigor and interpretability

This is quite a controversial topic, but this is how I see it. There are two main approaches to the idea of statistics/learning:

Understand how learning works, and replicate it based on this understanding
Focus on results, no matter if it's at the cost of poor understanding

I think these two approaches were first dividing statisticians and machine learning practitioners, as Leo Breiman describes in [The two cultures](http://www2.math.uu.se/~thulin/mm/breiman.pdf). And in a similar way, today it divides the Deep learning school, which is somehow winning in terms of results, from other techniques.

My view on deep learning is that we've managed to understand in a general way the how the human brain works. Not why, but with the research of people like Santiago Ramon y Cajal, Camilo Golgi, Donald Hebb..., we know that it's a network of neurons, and that the "intelligence" is on how the neurons connect, and not in the neurons themselves.

With the research of Warren McCulloch, Walter Pitts, John Hopfield, Geofreey Hinton..., we can replicate this structure of neurons in an artificial way. Just with a set of connected linear regressions, with activation functions to break the linearity. And with current computation power, including optimized hardware like GPUs, we can implement networks of neurons at a huge scale. We know that the model works, because it works for the human brain, and we're confident it's the same. But we don't know how each neuron is connected in the brain (how much signal it needs to receive from the other networks to activate), so we miss the weights of the linear regressions.

With techniques like backpropagation, stochastic gradient decent... we can optimize the weights to make useful things, like image or sound recognition and generation.

So, how I see it, the main question is:

Does it matter the rigor, how much we understand about what we do, how much we understand our models and their predictions? Or we just care about minimizing the out of sample error?

This may be a free interpretation of what was being discussed at NIPS, for example at [Ali Rahimi's talk](https://www.youtube.com/watch?v=Qi1Yry33TQE), or at the [interpretability debate](https://www.youtube.com/watch?v=2hW05ZfsUUo). It was interesting to see how excited people was about the debate, and the "celebrities" on the stage:

I think someone important was missing from the debate, and it's what Chris Olah and Shan Carter describe as [research debt](https://distill.pub/2017/research-debt/). Like in software, it's not only important what do you have today. It's important what will you have in the future. The best the internal quality of your software, the easier will be to improve it and add new features in the future. I think every good sofware engineer is aware of how important is to keep technical debt under control. But I don't think most researchers are aware that our understanding of the research today, is key for future research.

So, in my opinion, it's not that important that with deep learning we can have state of the art results in many areas. I don't think we'll have much better results in the future, unless we focus on quality research, and not just trying random things to get a small increase in the model accuracy.

GANs

I think Generative Adversarial Networks were by far the most popular topic at NIPS. I'm not sure how many talks [Ian Goodfellow](https://twitter.com/goodfellow_ian) gave, but it don't think it wasn't far from one every day. And it was all sort of applications of GANs, including many for creativity and design. We're not yet in the point of being able to generate arbitrary images with high definition, but it doesn't seem it'll take that long to have even more impressive results than what we've already seen. One of the most discussed articles was the [GAN that generates celebrity faces](http://research.nvidia.com/publication/2017-10_Progressive-Growing-of).

Bayesian statistics

Bayesian statistics was also very present during the whole NIPS. Many times together with deep learning, like in the [Bayesian deep learning and deep Bayesian learning](https://www.youtube.com/watch?v=LVBvJsTr3rg) talk, the [Bayesian deep learning workshop](http://bayesiandeeplearning.org/), or the [Bayesian GAN paper](https://arxiv.org/abs/1705.09558). Gaussian processes and Bayesian optimization was also present from the tutorials, to the workshops.

Surprisingly to me, most of the papers presented about multi-armed bandit problems were based on frequentist statistics. And I say surprisingly, because I think the industry is mostly adopting Bayesian methods for A/B testing, one of the main applications. In my opinion Bayesian methods are much simpler and intuitive, and tend to offer better results. One of the hot topics in this area is lowering the false discovery rate in repeated tests. And many paper about contextual bandits were also presented, and are that I discovered at NIPS.

Reinforcement learning

RL was the last of the main topics that kept repeating during the whole NIPS, if I'm not missing any. Both based on the classic q-learning, or by using deep learning representations.

About the conference

It was the first time for me attending an academic conference, and some things weren't very intuitive, being used to open source of business conferences. This is a random list with my thoughts:

I found the location quite good:

Near to a main airport, so I could fly directly from London
Good temperature
Many hotels nearby
English speaking country
The only problem with the location was that people from several countries (e.g. Iran) were banned from attending, as the organizers mentioned in the home page of the conference

I found the use of an app to communicate during the conference quite convenient. Even if the app had some obvious flaws, like the mess with the list of discussions, it added a lot of value

I found it difficult to know what to expect about food. I think in all previous conference I attended (and they are not few), breakfast and lunch was provided. At #NIPS it was advertised in the schedule that breakfast wasn't offered first time in the morning, no other mention. Then, breakfast was provided later in the morning (one day the ![](https://2.bp.blogspot.com/-sytt8-nPHl8/WjWYACiZGuI/AAAAAAAAymo/FMSCxkkEsTgxLw20XjnvuJH6iJyR9Ux2gCLcBGAs/s320/IMG_20171205_112058.jpg)

Compared to open source conferences, I found the atmosphere at NIPS very different. May be it's by the nature of research and open source, but my experience is that open source conferences have a very collaborative environment. You don't necessarily need to like or use someone else's project, to have a friendly discussion or appreciate his contribution. But I felt research quite a competitive environment. More than once I saw people in presentations or posters addressing the presenter in a not very nice way. Challenging their research, trying to point out that they know better. I think providing constructive feedback is always great, but I found sad this feeling of mine (that may be biased by just the few examples I saw) that researchers see each others more as rivals, than as part of a community that delivers together.

My NIPS write up

Deep learning rigor and interpretability

GANs

Bayesian statistics

Reinforcement learning

Other topics

About the conference

Systems

A sad story