NVIDIA GeForce RTX 3090 Founders Edition Dual Fan 24GB GDDR6X GPU Card (Refurb) $700 (Select Stores) + Free Store Pickup

slimdunkin117

Mar 15, 2024

1,686 Posts

Joined Sep 2018

Mar 15, 2024

slimdunkin117

Mar 15, 2024

1,686 Posts

Our community has rated this post as helpful. If you agree, why not thank slimdunkin117

Quote from xlongx

:

It's a great card. But for gamers, seems 4070 ti super with 3 years of warranty is a better choice.

7900xt is the better choice

19

1

6

wpc

Mar 15, 2024

5,233 Posts

Joined Jun 2010

Mar 15, 2024

wpc

Mar 15, 2024

5,233 Posts

Quote from slimdunkin117

:

7900xt is the better choice

did you just bring up AMD in an NVIDIA deal thread? EEK!

13

9

playdc

Mar 15, 2024

210 Posts

Joined Oct 2009

Mar 15, 2024

playdc

Mar 15, 2024

210 Posts

Quote from duijver

:

What is the investment time + hardware to play around with your own GPT / LLM model?

I can play with GPT to see what it spits out - I am curious to hear from someone that has done it.

Go ollama.com. One click to installed ollama，then one commond to run any large language model in it library.

2

esy1219

Mar 15, 2024

149 Posts

Joined Apr 2013

Mar 15, 2024

esy1219

Mar 15, 2024

149 Posts

anyone have any idea on what refurbished could mean? did they just repackage GPU or something significant had to be changed like a ram chip? Also, would a refurb gpu be worth the risk? I'm assuming as others have stated, going for the 4070 would be better as the performance would be similar?

2

Meteo

Pro

Mar 15, 2024

886 Posts

Joined Apr 2007

Mar 15, 2024

Meteo

Pro

Mar 15, 2024

886 Posts

Quote from duijver

:

What is the investment time + hardware to play around with your own GPT / LLM model?

I can play with GPT to see what it spits out - I am curious to hear from someone that has done it.

Once you get things working, you should checkout oobabooga's UI. its really popular in the community https://github.com/oobabooga/text...tion-webui

allows you to do all sorts of things from trying different models, adjusting parameters, and even finetuning

6

Timless

Mar 15, 2024

2,497 Posts

Joined May 2018

Mar 15, 2024

Timless

Mar 15, 2024

2,497 Posts

Quote from wpc

:

that would just be too much gpu power. all gamers would be jealous

Do any games actually support nvlink?

duijver

Mar 16, 2024

2,346 Posts

Joined Apr 2006

Mar 16, 2024

duijver

Mar 16, 2024

2,346 Posts

Quote from HappyAccident

:

If you have a 24GB card, just download koboldcpp which is a 250mb exec, and get a GGUF model off huggingface -- a 20B or 34B model is about 15GB, then run it. Total time, including installing card and drivers ~1hr.

Check out /r/localllama on reddit.

Thank you. I went with LM Studio to start off and it runs really fast on a macbook pro / M3 Pro. I grabbed mistral 7B to start out with since I am not sure how high I can go with a MBP.

2

duijver

Mar 16, 2024

2,346 Posts

Joined Apr 2006

Mar 16, 2024

duijver

Mar 16, 2024

2,346 Posts

Quote from jpswaynos

:

If you're on Windows you can get setup in minutes: https://lmstudio.ai/

Hardware minimums are pretty low to run a 7B parameter LLM, but can ramp up substantially if you want to run a 30 or 60B parameter LLM and get more than a couple of Tokens/s.

Thank you! LM Studio is great and super easy to get started. It does not seem as powerful as the paid openAI or Perplexity... but I just started with a 7B and that is expected without doing anything. Around 30 minutes of playing including the downloading and throwing some code in VSC.

jason879

Mar 16, 2024

92 Posts

Joined Nov 2004

Mar 16, 2024

jason879

Mar 16, 2024

92 Posts

Got one 3090Ti last week for $799. Traded in a few of AMD 50/60 series cards which I don't use anymore. Good card for AI workload. I'm planning to replace existing 4 x Tesla P40 fleet with these.

It's super efficient during idle and consumes only 3w without connect to any monitor.

Code:

+-----------------------------------------------------------------------------------------+
| NVIDIA-SMI 550.54.14              Driver Version: 550.54.14      CUDA Version: 12.4     |
|-----------------------------------------+------------------------+----------------------+
| GPU  Name                 Persistence-M | Bus-Id          Disp.A | Volatile Uncorr. ECC |
| Fan  Temp   Perf          Pwr:Usage/Cap |           Memory-Usage | GPU-Util  Compute M. |
|                                         |                        |               MIG M. |
|=========================================+========================+======================|
|   0  NVIDIA GeForce RTX 3090 Ti     Off |   00000000:1B:00.0 Off |                  Off |
|  0%   40C    P8              3W /  450W |   20592MiB /  24564MiB |      0%      Default |
|                                         |                        |                  N/A |
+-----------------------------------------+------------------------+----------------------+

Last edited by jason879 March 15, 2024 at 06:38 PM.

6

HappyAccident

Mar 16, 2024

511 Posts

Joined Feb 2021

Mar 16, 2024

HappyAccident

Mar 16, 2024

511 Posts

Our community has rated this post as helpful. If you agree, why not thank HappyAccident

Quote from duijver

:

Thank you. I went with LM Studio to start off and it runs really fast on a macbook pro / M3 Pro. I grabbed mistral 7B to start out with since I am not sure how high I can go with a MBP.

With MBP you can go as high as your system RAM. The key to language models with GPU vs CPU is memory bandwidth. A 3090 with 384bit GDDR6X will give you around 930GB/s. A Macbook Pro 2021 with M1 gives you 200GB/s. So you can expect a ~4.5x speedup just from memory bandwidth if you run the same model on a 3090 as a MBP. The thing is that if you look at how the models are processing data, the big roadblocks are the data transfer steps.

If you know what is going on inside a transformer model (I barely have a rough idea) it is computing by comparing different possibilities against its set of weights in order to ultimately find a response that fits within its constraints (the parameters you set when you load it -- not sure how LM studio does it, but if you see a 'temperature' slider, that set of options is what I am talking about). To do this it has to run through millions of tries over and over running through layers until it pops out a token. This requires vast amounts of data to be evaluated constantly and thus memory bandwidth is the key limiting factor for operation.

Hope I didn't get too much wrong with that description.

This is all related only to inference by the way (having the models compute responses), not training (teaching the models how to compute responses) which has similar constraints but is a different process and can rely on different factors.

3

Frank_Nitty

Mar 16, 2024

7,359 Posts

Joined Jul 2016

Mar 16, 2024

Frank_Nitty

Mar 16, 2024

7,359 Posts

Not a bad price, but a 4080 Super at MSRP would be a better alternative. The 3090 has 24GB of RAM I would never use in its entirety, and plus my 6950 XT still suits me just fine.

4

MR_FLY_GUY

Mar 16, 2024

2,186 Posts

Joined Feb 2010

Mar 16, 2024

MR_FLY_GUY

Mar 16, 2024

2,186 Posts

Quote from SehoneyDP

:

Ah that makes sense ty. I got a 1440p monitor for now, so probably don't really see a need. I do feel like I could use more for PCVR though. I'll have to look at which one is better for that. I can still wait until the 5000 series are out though tbh.

16gb vram is perfect for any resolution. U don't need 24.

4

sky0102

Mar 16, 2024

849 Posts

Joined Jul 2011

Mar 16, 2024

sky0102

Mar 16, 2024

849 Posts

mang, this used to be over $2.5k at the peak of the pandemic and mining craze with an HP rig with a 10th gen intel.

1

big819

Mar 16, 2024

80 Posts

Joined Apr 2022

Mar 16, 2024

big819

Mar 16, 2024

80 Posts

how is microcenter refurb quality? years ago, they were pretty bad right?.... i mean after all these years, are they better now?

NVIDIA GeForce RTX 3090 Founders Edition Dual Fan 24GB GDDR6X GPU Card (Refurb)

$700

Editor's Notes

Original Post

Editor's Notes

Original Post

Community Voting

Top Comments

148 Comments

Related Searches

Popular Deals

Trending Deals