#Design Process
https://www.youtube.com/watch?v=g2vlqhefADk
1. 搜尋符合現有task的architecture
- transfer learning: using model that pre-trained on large open dataset and fine-tune
- 如果需要special case, things that not showed up before, THEN
2. 搜尋好用的design patterns
(1) for CV
(2) for NLP
- attention, Transfomer
3. NAS
- 現階段可能還是輸hand-made, 有論文研究NAS實際上還是比Efficient Model還慢很多就算fp operations比較少, 可能因為NAS容易產生太多network fragmentations
- 權重無關網路搜索
4. Custom Model
Conv Model
(1) How to choose the number of layers and units?