Author: Allan Witt | Page 6

Allan Witt is the co-founder and Editor-in-Chief of Hardware-Corner.net. Computers and the web have fascinated him since childhood. In 2011, he began training as an IT specialist at a mid-sized company while launching a tech blog on the side—quickly discovering a passion for writing about hardware and technology.

After completing his training, Allan worked as a system administrator for two years. Alongside that, he started building and upgrading custom gaming PCs at a local hardware shop. What began as a part-time project grew into a full-time career. Today, his work also focuses on building and optimizing PC systems for local AI and LLM workloads, combining hands-on experience with a passion for making complex tech easy to understand.

Mar. 25, 2025 / LLM Hardware News

How Fast is Mac Studio M3 Ultra Running the New DeepSeek V3 LLM?

The benchmarks were conducted using MLX and DeepSeek-V3-0324 in 4-bit quantization on a Mac Studio with an M3 Ultra. At 16K tokens, memory usage peaked at 466GB, demonstrating Apple’s advantage in handling large models locally.
Mar. 23, 2025 / LLM Hardware News

96GB VRAM LLM PC Build Under $10K? RTX Pro 6000 vs. Dual 48GB 4090-Which Wins?

While NVIDIA’s newly announced RTX Pro 6000 offers a straightforward 96GB VRAM solution, , a new wave of modified RTX 4090 from China - offering 48GB per card - has emerged as a potential alternative.
Mar. 21, 2025 / LLM Hardware News

NVIDIA Just Made the RTX 5090 Look Weak for LLM – Meet the RTX Pro 6000!

The RTX Pro 6000, has arrived with a spec sheet that firmly cements it as a Titan-class card. With its high core count, extensive memory capacity, the RTX Pro 6000 stands as a serious competitor to the RTX 5090
Mar. 20, 2025 / LLM Hardware News

RTX PRO 6000 GPU Can Run 70B LLM Models – But There’s a Catch!

With 96GB of GDDR7 memory, 1.79 TB/s memory bandwidth, RTX PRO 6000 is the first single-card workstation GPU capable of fully loading an 8-bit quantized 70B model such as LLaMA 3.3.
Mar. 19, 2025 / LLM Hardware News

$3,000 for THIS? NVIDIA’s DGX Spark Faces Tough Competition

With a starting price of $2,999 (though the Founders Edition is listed at $3,999), the DGX Spark aims to democratize access to high-memory compute for local AI inference.
Mar. 18, 2025 / LLM Hardware News

The First Mini-PC to Run 70B LLMs Locally: GMK EVO-X2 Unveiled

The GMK EVO-X2, which was recently showcased at AMD’s "ADVANCING AI" Summit, is designed to meet this need, packing impressive AI processing capabilities into a small form factor.
Mar. 17, 2025 / LLM Hardware News

How Fast Will a Ryzen AI MAX+ 395 (Strix Halo) System Be for LLM Inference?

AMD’s Ryzen AI MAX+ 395 (Strix Halo) brings a unique approach to local AI inference, offering a massive memory allocation advantage over traditional desktop GPUs like the RTX 3090, 4090, or even the upcoming 5090..
Mar. 15, 2025 / LLM Hardware News

$10,000 Mac Studio Destroys NVIDIA in AI Battle?! Run Massive LLMs at HOME!

The world of local AI has just been flipped on its head, and you won’t BELIEVE which tech giant is leading the charge! Forget cramming multiple power-hungry NVIDIA GPUs into your rig just to touch the edge of massive language models. Apple’s brand new Mac Studio with the M3 Ultra chip is here, and it’s...
Feb. 20, 2025 / LLM Hardware News

NVIDIA Limits RTX 50 Sales: Impact on Local LLM & AI Enthusiasts

NVIDIA appears to be taking steps to manage the supply of its next-generation GPUs, potentially in response to continued shortages and pricing concerns.
Sep. 15, 2024 / Desktops computers

How to Identifying Your Graphics Card

In the ever-evolving landscape of PC hardware, the graphics processing unit (GPU) stands as a cornerstone of system performance, particularly for gaming and content creation. However, many users find themselves at a loss when it comes to identifying the exact model residing in their machine. We’ll begin with the most straightforward method, leveraging built-in operating...
Sep. 15, 2024 / Help and FAQ

Can Llama 3.1 Generate and Interpret Images?

No, Llama 3.1 cannot directly generate or interpret (Vision model) images in its current official open weights versions (8B , 70B, and 405B). The Llama 3.1 models released by Meta are primarily language models designed for tasks like text generation, reasoning, and classification with advanced language understanding capabilities. These models have features such as multilinguality,...
Jan. 10, 2024 / Desktops computers

How to Increase the VRAM of Your Mac with Apple Silicone for LLMs?

It is surprisingly straightforward to increase the VRAM of your Mac (Apple Silicone M1/M2/M3 chips) computer and use it to load large language models. Here’s the rundown of my experiments. For 32 GB RAM Mac model I recently experimented with a 32GB MacBook Pro equipped with an M1 Pro chip, attempting to load a 24GB...
Jan. 3, 2024 / General topics

Why Install and Host Large Langue Model on your Computer

I saw questions on couple of places about hosting your own LLM and thought I’d chime in with my two cents. Hosting your own Large Language Model (LLM) is pretty cool for a couple of reasons. Privacy is a big one for many folks. If you’re working with sensitive data, you definitely don’t want to...
Dec. 2, 2023 / LLM Hardware News

Maximize Efficiency in LLM Training: New Software Offers 80% Speed Increase with 50% Less Memory

The new AI startup Unsloth has unveiled its latest product, targeted toward the field of Large Language Model (LLM) training. Their flagship software, promises an astounding 30x faster training speed for LLMs, with a substantial 60% reduction in memory usage, and no compromise on accuracy. This method is set to transform how AI models are...
Nov. 4, 2023 / Desktops computers

What Is The Right LLM For RTX 3060 (12GB)?

For those looking to run large language models like Llama-2 and Mistral, on a PC built with RTX 3060 (12GB VRAM), here’s a concise guide on what model you can run. For the RTX 3060 (12GB), you can utilize any GPTQ 7B model. Models in GGUF format can be used with up to 8-bit (Q8)...
Jun. 7, 2023 / Help and FAQ

What is GGML Tensor library?

GGML is a C library that enables you to perform fast and flexible tensor operations and machine learning tasks. Currently, the combination between GGML and llama.cpp is the best option for running LLaMa based model like Alpaca, Vicuna, or Wizard on your personal computer’s CPU. You can use GGML converted weights (GGML or GGUF file...
May. 24, 2023 / Desktops computers

Where is the best place for a PC? Above or below the desk?

If you are a desktop PC user, you may have wondered where to place your PC: under or over your desk. Both options have their pros and cons, and the best choice depends on your personal preferences, needs and situation. I have personally tested both options and these are some of the factors I consider...
May. 22, 2023 / Desktops computers

Switching to PC Gaming: Is it Worth it Nowadays?

I wanted to share my thoughts on PC gaming and why I think it’s a great choice for gaming enthusiasts like us. So, let’s dive in and discuss the reasons why switching to PC gaming is totally worth it! Better Graphics Quality When it comes to graphics, PC gaming takes the crown. Gaming PCs are...
Feb. 13, 2023 / Desktops computers

Best CPU for Dell 0pgkwf motherboard

The best CPU you can install on a Dell 0PGKWF motherboard is Intel’s 2nd Gen Core i7-2700K (Sandy Bridge) processor. The 0PGKWF motherboard is used in OptiPlex 990 Ultra Small Form Factor. According to the spec sheet, 990 USFF is using Q67 chipset and supports only 2nd Generation Intel Core i7, i5, i3 processors. Keep...
Jan. 14, 2023 / Help and FAQ

What is ExpressConnect DBWM (ECDBWM.exe)

ExpressConnect (ecdbmw.exe process) is a software application that works in conjunction with Dell Optimizer to optimize the network performance of your system. It offers a client-centric approach to network optimization. If you see the process “ecdbmw.exe” in your Task Manager, it indicates that Dell Optimizer has been installed on your system. Usually the software comes...