Welcome! aiXamine is a SaaS service for evaluating LLMs and their data against various safety issue and security threats. Our goal is to provide individuals and organizations with a reliable service to examine the safety and security of their models before they are integrated into end-user applications. Below, you can find detailed information on how to use aiXamine and learn more about the supported examination services.
📄 Our paper is now available on arXiv!
How To
Search Examination Reports
Create an Examination
Compare Models in a Leaderboard
View an Examination Report
Manage User Account
Learn About
Adversarial Robustness
Code Security
Fairness & Bias
Hallucination
Jailbreaking
Model & Data Privacy
OOD Robustness
Over Refusal
Safety & Alignment
Disclaimers
- Datasets used in aiXamine contain dialogue that may be considered offensive or harmful.
- During examinations, we allow up to 1% of model responses to be empty. Scores are calculated only on the remaining valid responses. If more than 1% of responses are empty, the evaluation is considered failed.
References
References