If you want 100% accuracy from these kinds of tasks with LLMs you can get it today, but you need to provide the LLM with the ability to run Python code and tell it to use something like Pandas.
You can confirm it's doing the right thing by reviewing the code it wrote.
You can confirm it's doing the right thing by reviewing the code it wrote.