Hello!
Writing
Notes on fullstack engineering, AI in production, and shipping reliable software—newest first.
Uncover the vulnerabilities and biases in current AI agent benchmarks and learn practical Python strategies to build more robust, secure, and trustworthy LLM evaluation frameworks.
Explore practical Python strategies and techniques for developing robust Qwen3.6-Plus powered LLM agents that interact seamlessly with tools and APIs, tackling real-world deployment challenges.