Years of experience tells me I should generally avoid Apple’s first generation product. First generation Apple Watch, first generation iPhone, etc. left a lot to be desired. I wouldn’t want to try the first generation Apple modem in a daily driver iPhone.
Using Ollama to try a couple of models right now for an idea. I’ve tried to run Llama 3.2 and Qwen 2.5 3b, both of which fits my 3050 6G’s VRAM. I’ve also tried for fun to use Qwen 2.5 32b, which fits in my RAM (I’ve got 128G) but it was only able to reply a couple of tokens per second, thereby making it very much a non-interactive experience. Will need to explore the response time piece a bit further to see if there are ways I can lean on larger models with longer delays still.