Gumbel Counterfactual Generation from Language Models