Make Every feature Binary: A 135B parameter sparse neural network for massively improved search relevance
Recently, Transformer-based deep learning models like GPT-3 have been getting a lot of attention in the machine learning world. These models excel at understanding semantic relationships, and they have contributed to large improvements in Microsoft Bing’s search experience (opens in new tab) and surpassing human…