ChatGPT's new browser has potential, if you're willing to pay

themachinestops@lemmy.dbzer0.com · edit-2 8 days ago

ChatGPT's new browser has potential, if you're willing to pay

MagicShel@lemmy.zip · 7 days ago

I’ll look into it. OAI’s 30B model is the most I can run in my MacBook and it’s decent. I don’t think I can even run that on my desktop with a 3060 GPU. I have access to GLM 4.6 through a service but that’s the ~350B parameter model and I’m pretty sure that’s not what you’re running at home.

It’s pretty reasonable in capability. I want to play around with setting up RAG pipelines for specific domain knowledge, but I’m just getting started.

brucethemoose@lemmy.world · edit-2 7 days ago

I have access to GLM 4.6 through a service but that’s the ~350B parameter model and I’m pretty sure that’s not what you’re running at home.

It is. I’m running this model, with hybrid CPU+GPU inference, specifically: https://huggingface.co/Downtown-Case/GLM-4.6-128GB-RAM-IK-GGUF

You can likely run GLM Air on your 3060 desktop if you have 48GB+ RAM, or a smaller MoE easily. Heck. I’ll make a quant just for you, if you want.

Depending on the use case, I’d recommend ERNIE 4.5 21B (or 28B for vision) on your Macbook, or a Qwen 30B variant. Look for DWQ MLX quants, specifically: https://huggingface.co/models?sort=modified&search=dwq

ChatGPT's new browser has potential, if you're willing to pay

ChatGPT's new browser has potential, if you're willing to pay

I tried ChatGPT's Atlas browser to rival Google - here's what I found