| | --- |
| | license: apache-2.0 |
| | language: |
| | - en |
| | base_model: |
| | - Qwen/Qwen3-Embedding-8B |
| | --- |
| | ## LEXA-8B |
| |  |
| | 👉 **LEXA-8B**: LEXA: Legal Case Retrieval via Graph Contrastive Learning with Contextualised LLM Embeddings. More information is available in [**arXiv**](https://arxiv.org/abs/2405.11791) & [**GitHub**](https://github.com/yanran-tang/CaseGNN). |
| |
|
| | ## Example Usage |
| |
|
| | ```python |
| | from transformers import AutoModel, AutoTokenizer |
| | model = AutoModel.from_pretrained("AnnaStudy/LEXA-8B", torch_dtype="auto", device_map="auto") |
| | tokenizer = AutoTokenizer.from_pretrained("AnnaStudy/LEXA-8B") |
| | case_txt = "The following contains key components of a legal case. Legal facts..." |
| | tokenized = tokenizer(case_txt, return_tensors='pt', padding=True, truncation=True, max_length=2048) |
| | outputs = model(**tokenized) |
| | case_embedding = outputs.last_hidden_state[:, -1] |
| | ``` |
| | ## Base Model |
| |
|
| | ReaKase-8B is finetuned from **Qwen3-Embedding-8B**, which provides the underlying semantic representation capability. |
| |
|
| | Reference: [Qwen/Qwen3-Embedding-8B](https://huggingface.co/Qwen/Qwen3-Embedding-8B) |
| |
|
| | ## Cite |
| | If you find this repo useful, please cite |
| | ``` |
| | @article |
| | {LEXA-8B, |
| | author = {Yanran Tang, Ruihong Qiu, Xue Li, Zi Huang}, |
| | title = {LEXA: Legal Case Retrieval via Graph Contrastive Learning with Contextualised LLM Embeddings}, |
| | journal = {CoRR}, |
| | volume = {abs/2405.11791}, |
| | year = {2025} |
| | } |
| | ``` |