Understanding Agentic Function-Calling with Multi-Modal Data Access