Replies: 2 comments 6 replies
-
|
mmm I missed this other discussion which might already give me a sort of answer to my problem... Just keeping this alive in case anyone wants to chime in 😺 |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
Hi there. Could you share what operating system this is on, and whether you are running this on cpu/mps/cuda devices? |
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
First of all, I think the book is great, so, chapeau to you, seriously! :)
I have just finished chapter 5 and realized that my output are slightly different than the ones in the book.
My
For instance, in the book, the very last generated sentence is:
In my code, the resulting sentence is the following instead:
I also notices in the repository notebook that a variable not present in the book (
inference_device) was introduced to "ensure readers get same results in the remaining sections of this book". I tried to add that to my code but the result did not match that of the book still... (the one I pasted above).Did anyone else experience the same? My code does not throw any error and the result is still somewhat meaningful, so I wonder if wither I did something wrong or if this difference is expected. I would want to make sure of this before proceeding to chapter 6. Here is the repository where I am keeping track of my progress (together with the version of the various packages in
requirements.txtwhich I am using, which is the same of the book except for torch (2.4.1 instead of 2.4.0) because I had some issues with that.https://github.com/umbe1987/build-LLM-from-scratch
Beta Was this translation helpful? Give feedback.
All reactions