Skip to content

Commit c61c497

Browse files
committed
Merge remote-tracking branch 'origin/main'
2 parents bea879c + bb31526 commit c61c497

File tree

2 files changed

+5
-11
lines changed

2 files changed

+5
-11
lines changed

Dockerfile

Lines changed: 1 addition & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -1,28 +1,20 @@
1-
# Use Python 3.9 as base image
21
FROM python:3.9
32

43
WORKDIR /wsgi
54

6-
# Copy only dependency files first to leverage Docker caching
75
COPY Pipfile Pipfile.lock /wsgi/
86

9-
# Install Pipenv and dependencies
107
RUN pip install pipenv && pipenv install --deploy --ignore-pipfile
118

12-
# Install CPU-only PyTorch (avoids CUDA errors)
139
RUN pipenv run pip install torch --index-url https://download.pytorch.org/whl/cpu
1410

15-
# Download spaCy model inside the virtual environment
1611
RUN pipenv run python -m spacy download en_core_web_sm
1712

18-
# Copy the application files
1913
COPY . /wsgi
2014

21-
# Unset HF_TOKEN to prevent conflicts
15+
# To prevent conflicts with .env
2216
ENV HF_TOKEN=""
2317

24-
# Expose the port for Flask
2518
EXPOSE 3008
2619

27-
# Run Flask
2820
CMD ["pipenv", "run", "flask", "run", "--host=0.0.0.0", "--port=3008"]

README.md

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -3,12 +3,14 @@
33
---
44

55
## Introduction
6-
FeaClustRE (**Feature Clustering and Analysis Visualization Tool**) is an advanced tool designed to **analyze, cluster, and visualize structured data features** using machine learning techniques. It provides **hierarchical clustering, dendrogram visualizations, and feature-based insights** to help researchers and data scientists explore complex datasets.
6+
FeaClustRE (**Feature Clustering and Analysis Visualization Tool**) is an advanced tool designed to **analyze, cluster, and visualize structured hierarchical features** using NLP and LLM models and techniques. It provides **hierarchical clustering, dendrogram visualizations, and evaluations** to help to explore complex lists of features.
77

88
This tool uses **Meta's LLaMA model** for feature embedding and **Hugging Face's Transformers** for feature family clustering.
99

1010
With a flexible **backend API**, a **CLI client**, and **visualization tools**, FeaClustRE supports both **interactive analysis and automated batch processing**.
1111

12+
This tool is part of the RE-Miner Ecosystem, which can be explored in the [GESSI-NLP4SE repository](https://github.com/nlp4se).
13+
1214
### Key Features
1315
- **Custom Clustering Algorithm** – Uses a hand-made affinity-based clustering approach to automatically group similar features.
1416
- **Dendrogram Visualization** – Generates hierarchical visualizations to explore feature relationships.
@@ -48,7 +50,7 @@ This project uses **Meta's LLaMA model**, which is **gated** and requires **manu
4850
### **Using Your Hugging Face Token**
4951
To authenticate, you **must set your Hugging Face token** before running the project.
5052

51-
#### **1️⃣ Set the Token in `.env`**
53+
#### **Set the Token in `.env`**
5254
In the `.env` file in the project root, add:
5355

5456
```

0 commit comments

Comments
 (0)