US data bias in LLM training sets
Large language models are trained on data that over-represents US companies relative to Canadian brands of equivalent size and market position. When a buyer asks an AI about a Canadian company, the AI often has less — and lower-quality — data to draw on. Descriptions are thinner, less accurate, and more prone to fabrication than those generated for comparable US brands.