| About Machine Learning |   | 0 | 951 | December 25, 2019 | 
        
          | 现在短视频很火,短视频图文识别技术很热 |     | 1 | 907 | June 9, 2023 | 
        
          | Incorporating Hierarchy into Text Encoder: a Contrastive Learning Approach |   | 0 | 863 | December 25, 2022 | 
        
          | LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic |   | 0 | 797 | October 7, 2022 | 
        
          | Rethinking the Inception Architecture for Computer Vision |   | 0 | 1206 | April 14, 2022 | 
        
          | We Need to Talk About train-dev-test Splits |   | 0 | 931 | December 2, 2021 | 
        
          | 简单有效的位置编码 |   | 0 | 1267 | November 27, 2021 | 
        
          | SELFEXPLAIN: A Self-Explaining Architecture for Neural Text Classifiers |   | 0 | 905 | November 20, 2021 | 
        
          | MTAdam: Automatic Balancing of Multiple Training Loss Terms |   | 0 | 887 | November 19, 2021 | 
        
          | What to Pre-Train on? Efficient Intermediate Task Selection |   | 0 | 791 | November 18, 2021 | 
        
          | Effects of Parameter Norm Growth During Transformer Training: Inductive Bias from Gradient Descent |   | 0 | 1005 | November 17, 2021 | 
        
          | Conditional Poisson Stochastic Beams |   | 0 | 913 | November 17, 2021 | 
        
          | Do Transformers Really Perform Bad for Graph Representation? |   | 0 | 1119 | October 9, 2021 | 
        
          | 好奇,有没有那种工具,NLP任务数据读取、转化的python包 |     | 2 | 1147 | August 22, 2021 | 
        
          | pytorch版Loss实现苏剑林版【将“softmax+交叉熵”推广到多标签分类问题】 |   | 0 | 2134 | August 22, 2021 | 
        
          | pytorch版Loss实现苏剑林版【通过互信息思想来缓解类别不平衡问题】 |   | 0 | 1602 | August 22, 2021 | 
        
          | Convolutions and Self-Attention: Re-interpreting Relative Positions in... |   | 0 | 834 | August 16, 2021 | 
        
          | Reservoir Transformers |   | 0 | 835 | August 15, 2021 | 
        
          | Structural Knowledge Distillation: Tractably Distilling Information for... |   | 0 | 1235 | August 10, 2021 | 
        
          | Parameter-efficient Multi-task Fine-tuning for Transformers via Shared... |   | 0 | 1017 | August 11, 2021 | 
        
          | Self-Training with Weak Supervision |   | 0 | 863 | July 30, 2021 | 
        
          | A Generative Model for Joint Natural Language Understanding and Generation |   | 0 | 929 | June 21, 2021 | 
        
          | Self-Attention Guided Copy Mechanism for Abstractive Summarization |   | 0 | 1289 | June 16, 2021 | 
        
          | NBDT: Neural-Backed Decision Trees |   | 0 | 1608 | June 10, 2021 | 
        
          | Active Imitation Learning with Noisy Guidance |   | 0 | 929 | May 27, 2021 | 
        
          | Integrating Semantic and Structural Information with Graph Convolutional... |   | 0 | 1018 | May 7, 2021 | 
        
          | Intel的二代神经棒有人用过没 |   | 0 | 914 | April 12, 2021 | 
        
          | TensorFlow 2.0 VS PyTorch? |           | 13 | 5780 | April 3, 2021 | 
        
          | Learning Sparse Neural Networks through L0 Regularization |   | 0 | 1401 | February 13, 2021 | 
        
          | Dice Loss for Data-imbalanced NLP Tasks |   | 0 | 1213 | February 18, 2021 |