Skip to content

Commit d7f4374

Browse files
committed
light refactor
1 parent bd3ff42 commit d7f4374

File tree

6 files changed

+18
-11
lines changed

6 files changed

+18
-11
lines changed

notes.md

Lines changed: 17 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -46,11 +46,12 @@ protip, use perplexity.ai as a resource scrapper
4646
- [ ] wait, should I try to train LSTM like I did for Transformers?
4747
- They dont support parallel computation, but recently **xLSTM** dropped which does.
4848
- [ ] Seq2Seq (Ilya, 2014)
49-
- [ ] JukeBox
49+
- [ ] JukeBox- openai
5050
- [ ] Mixture of experts (MoE) [This?](https://arxiv.org/pdf/1701.06538)
51-
- [ ] LLaMA
51+
- [ ] LLaMA- metaai
5252
- [ ] Switch Transformers
5353
- [ ] Multi-modality
54+
- [ ] https://arxiv.org/pdf/2405.09818v1
5455
- [ ] Beam Search?
5556
- [ ] RAG
5657
- [ ] Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention
@@ -77,19 +78,25 @@ protip, use perplexity.ai as a resource scrapper
7778
- WizardMath?
7879

7980

81+
## Training
82+
- [ ] LoRA: https://arxiv.org/abs/2106.09685
83+
8084

8185
## Vision
82-
- [ ] CNN Casestudy:
83-
- [ ] CNN - {Le - Alex - ZF - VGG - GoogLe}Net (inception architecture)
86+
- [x] CNN Casestudy:
87+
- [x] CNN - {Le - Alex - ZF - VGG - Google}Net
88+
- TODO: (inception architecture)
89+
- [ ] ResNet (residual and skip connection, research paper)
8490
- [ ] Visualizing CNN techniques
8591
- [ ] DeepDream?
8692
- [ ] Localization and Segmentation (cs231n)
87-
- [ ] Fast Faster Fastest?? R? - CNN's
88-
- [ ] ResNet (residual and skip connection, research paper)
89-
- [ ] yolo
93+
- [ ] R-CNN
94+
- [ ] Fast R-CNN
95+
- [ ] Faster R-CNN
96+
- [ ] YOLO: you only look once
97+
- [ ] SSD
9098
- [ ] CLIP-ResNet (read somewhere kinda interesting, mostprobably best ResNet till date? not sure)
9199
- [ ] train something on COCO dataset? A good task?
92-
- [ ] SSD?
93100

94101

95102

@@ -105,10 +112,10 @@ protip, use perplexity.ai as a resource scrapper
105112

106113
## Reinforcement learning
107114
- [ ] RF
108-
- [ ] RoPE (it goes here? dont know.)
109115
- [ ] DQN
110116
- [ ] Policy Gradient Methods
111117
- [ ] DPO
118+
- [ ] RoPE (it goes here? dont know.)
112119

113120

114121

@@ -160,4 +167,4 @@ protip, use perplexity.ai as a resource scrapper
160167

161168

162169
## Some resourceful repos
163-
- [ ] https://github.com/coqui-ai/TTS?tab=readme-ov-file#model-implementations
170+
- [ ] https://github.com/coqui-ai/TTS?tab=readme-ov-file#model-implementations

gemm.py renamed to scripts/gemm.py

File renamed without changes.

transformer/transformer_pytorch.ipynb renamed to transformer_pytorch.ipynb

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -28,7 +28,7 @@
2828
"metadata": {},
2929
"outputs": [],
3030
"source": [
31-
"with open('input.txt', 'r', encoding='utf-8') as f:\n",
31+
"with open('./data/input.txt', 'r', encoding='utf-8') as f:\n",
3232
" text = f.read()\n",
3333
"\n",
3434
"chars = sorted(list(set(text)))\n",
File renamed without changes.

0 commit comments

Comments
 (0)