copy and paste this google map to your website or blog!
Press copy button and paste into your blog or website.
(Please switch to 'HTML' mode when posting into your blog. Examples: WordPress Example, Blogger Example)
CLEVER: A Curated Benchmark for Formally Verified Code Generation TL;DR: We introduce CLEVER, a hand-curated benchmark for verified code generation in Lean It requires full formal specs and proofs No few-shot method solves all stages, making it a strong testbed for synthesis and formal reasoning
Clever: A Curated Benchmark for Formally Verified Code Generation We introduce CLEVER, the first curated benchmark for evaluating the generation of specifications and formally verified code in Lean The benchmark comprises of 161 programming problems; it evaluates both formal speci-fication generation and implementation synthesis from natural language, requiring formal correctness proofs for both
Submissions | OpenReview Promoting openness in scientific communication and the peer-review process
Evaluating the Robustness of Neural Networks: An Extreme Value. . . Our analysis yields a novel robustness metric called CLEVER, which is short for Cross Lipschitz Extreme Value for nEtwork Robustness The proposed CLEVER score is attack-agnostic and is computationally feasible for large neural networks
STAIR: Improving Safety Alignment with Introspective Reasoning Ensuring the safety and harmlessness of Large Language Models (LLMs) has become equally critical as their performance in applications However, existing safety alignment methods typically suffer
Do Histopathological Foundation Models Eliminate Batch Effects? A . . . Keywords: histopathology, foundation models, batch effects, Clever Hans effect, robustness, generalization Abstract: Deep learning has led to remarkable advancements in computational histopathology, e g , in diagnostics, biomarker prediction, and outcome prognosis
La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse. . . We use a clever technique that involves rotating the data within each layer of the model, making it easier to identify and keep only the most important parts for processing This ensures that the model remains fast and efficient without losing much accuracy
Counterfactual Debiasing for Fact Verification 579 In this paper, we have proposed a novel counter- factual framework CLEVER for debiasing fact- checking models Unlike existing works, CLEVER is augmentation-free and mitigates biases on infer- ence stage In CLEVER, the claim-evidence fusion model and the claim-only model are independently trained to capture the corresponding information
Contrastive Learning Via Equivariant Representation - OpenReview In this paper, we revisit the roles of augmentation strategies and equivariance in improving CL's efficacy We propose CLeVER (Contrastive Learning Via Equivariant Representation), a novel equivariant contrastive learning framework compatible with augmentation strategies of arbitrary complexity for various mainstream CL backbone models