The Voltaire Fabric Collective Accelerator (FCA) is a solution that offloads collective operations from CPUs to switches to accelerate performance and scalability. It uses CPUs in switches to perform reduction and messaging for collective operations. The FCA addresses network congestion through single-message transmissions per wire and shields collectives from node noise. It can reduce collective operation runtimes by up to 100x and enable linear scalability to thousands of nodes.