NLP-Cyber-Harm-Detection

NLP Fraud/Scam Detection Baseline Models

A comprehensive baseline implementation for fraud and scam detection using Natural Language Processing techniques. This project provides multiple approaches from simple keyword-based detection to advanced BERT-based classification.

📁 Repository

🔗 GitHub Repository: https://github.com/RockENZO/NLP-Cyber-Harm-Detection.git

🎯 Project Overview

This project implements baseline models for detecting fraudulent content (scams, phishing, spam) in text data, based on analysis of existing similar projects. It includes:

Traditional ML approaches (TF-IDF + SVM/Logistic Regression)
Deep Learning approaches (BERT and DistilBERT-based classification)
Simple rule-based approach (for quick prototyping)
🧠 AI-Powered Reasoning Pipeline (NEW) - Explains why texts are classified as fraud using LLM models

⚡ DistilBERT Model Highlights

NEW: The project now includes a production-ready DistilBERT model with significant advantages:

60% faster training than BERT while maintaining 97% performance
40% smaller model size - better for deployment and storage
Lower memory usage - fits better in resource-constrained environments
Faster inference times - ideal for real-time fraud detection
Multiclass classification - detects 9 specific fraud types + legitimate messages
GPU-optimized training - trained on Kaggle with full pipeline

The DistilBERT model is trained for multiclass classification, providing granular fraud type detection rather than just binary fraud/legitimate classification.

📁 Project Structure

├── README.md                           # This comprehensive documentation
├── requirements.txt                    # Python dependencies
├── final_fraud_detection_dataset.csv  # Training dataset (Git LFS)
├── models/                            # Saved trained models
│   ├── model.zip                      # Compressed model bundle (excluded from git)
│   ├── bert_model/                    # Trained BERT model files
│   ├── bert_tokenizer/               # BERT tokenizer files
│   ├── distilbert_model/             # Trained DistilBERT model files (60% faster)
│   └── distilbert_tokenizer/         # DistilBERT tokenizer files
├── training/                          # Training scripts and notebooks
│   ├── baseline_fraud_detection.py   # Traditional ML baseline models
│   ├── bert_fraud_detection.py       # BERT-based classifier
│   ├── fraud_detection_baseline.ipynb # Interactive Jupyter notebook
│   └── kaggle_fraud_detection.ipynb  # Kaggle-optimized training notebook
├── demos/                             # Demo and testing tools
│   ├── fraud_detection_demo.py       # Full-featured demo script
│   ├── fraud_detection_demo.ipynb    # Interactive demo notebook
│   └── quick_demo.py                 # Quick verification script
├── reasoning/                         # 🧠 AI-powered reasoning pipeline  
│   ├── GPT2_Fraud_Reasoning.ipynb    # GPT2-based reasoning analysis
│   └── KaggleLLMsReasoning.ipynb     # Local reasoning notebook
├── docs/                              # Documentation
│   └── nlp_terms_explanation.md      # NLP concepts explanation
├── runs/                              # Training run outputs and analysis results
│   ├── fraud_analysis_results_20250916_155231.csv
│   ├── fraud-detection-kaggle-training-bert-run.ipynb
│   ├── gpt2_fraud_analysis_20250917_034015.csv
│   ├── LLMsReasoningResultVisualization.ipynb
│   ├── MultipleLLMsReasoning(small-models).ipynb
│   └── LLMsStats/                     # LLM performance comparison charts
│       ├── llm_category_heatmap.png
│       ├── llm_comparison_table.csv
│       ├── llm_performance_comparison.png
│       ├── llm_quality_radar.png
│       ├── llm_size_performance.png
│       ├── llm_speed_quality_scatter.png
│       ├── llm_model_size_comparison.png    # Model size vs performance charts
│       └── llm_speed_quality_bubble.png     # Speed vs quality bubble chart
├── .gitattributes                     # Git LFS configuration
├── .gitignore                         # Git ignore rules
└── .git/                             # Git repository

🚀 Quick Start

Option 1: Use Pre-trained Model (Recommended)

If you have already trained a model on Kaggle:

Install Dependencies

pip install torch transformers pandas numpy matplotlib seaborn jupyter

Quick Test Your Model
```
python demos/quick_demo.py
```

Interactive Demo Notebook

jupyter notebook demos/fraud_detection_demo.ipynb

Full Demo Script
```
python demos/fraud_detection_demo.py
```

Local AI Reasoning

# Upload KaggleGPTReasoning.ipynb to Kaggle
# Enable GPU accelerator
# Run all cells for fraud detection + AI explanations
# Download results - no API costs

📊 LLM Performance Analysis: Check runs/LLMsStats/ for performance comparisons.

Option 2: Train from Scratch

Install Dependencies
```
pip install -r requirements.txt
```

Run Traditional ML Baselines

python training/baseline_fraud_detection.py

Run BERT Baseline (requires more computational resources)
```
python training/bert_fraud_detection.py
```

Option 3: Kaggle Training (Recommended for GPU access)

Upload final_fraud_detection_dataset.csv to Kaggle
Create a new notebook and copy the code from training/fraud-detection-kaggle-training-bert-run.ipynb
Enable GPU accelerator for fast BERT training
Download the trained models from Kaggle output
Use the demo scripts to test your trained model

Note: The dataset is stored with Git LFS due to its size (~158MB). Clone with git lfs pull to download the full dataset. Large model files like model.zip are excluded from git to keep the repository size manageable.

📊 LLM Performance Analysis Results

The runs/LLMsStats/ directory contains LLM model analysis for fraud reasoning tasks.

📊 Models Implemented

1. Traditional ML Baselines (`training/baseline_fraud_detection.py`)

TF-IDF + Logistic Regression
TF-IDF + Support Vector Machine (SVM)
Features:
- Text preprocessing (stopword removal, lemmatization)
- TF-IDF vectorization (5000 features)
- Cross-validation evaluation
- Feature importance analysis

2. BERT-Based Classifier (`training/bert_fraud_detection.py`)

Model: BERT-base-uncased fine-tuned for classification
Features:
- Contextual understanding
- Class imbalance handling (weighted loss)
- Pre-trained language model knowledge
- Transfer learning capabilities

3. DistilBERT-Based Classifier (`training/kaggle_fraud_detection.ipynb`)

Model: DistilBERT-base-uncased fine-tuned for multiclass classification (9 fraud types + legitimate)
Advantages over BERT:
- 60% faster training time - ideal for iterative experimentation
- 40% smaller model size - better for deployment and storage
- Lower memory usage - fits better within resource constraints
- 97% of BERT’s performance - minimal accuracy trade-off
- Faster inference - better for real-time fraud detection systems
Features:
- Multiclass classification (10 classes total)
- GPU-accelerated training on Kaggle
- Production-ready lightweight model

4. Kaggle Training Notebook (`runs/fraud-detection-kaggle-training-bert-run.ipynb`)

GPU-accelerated training on Kaggle’s free infrastructure
Complete pipeline: Data loading, preprocessing, training, evaluation
Model export: Saves trained models for download
DistilBERT support: Optimized for faster training and deployment

5. AI-Powered Reasoning Pipeline (`reasoning/`)

Local Processing: Use reasoning/KaggleGPTReasoning.ipynb for local reasoning analysis
No API costs: Runs locally on Kaggle’s GPU resources
Privacy-focused: No data sent to external APIs
Selective reasoning: Only explains fraud classifications (legitimate content skipped)
Educational: Identifies specific scam indicators and risk factors
Easy Integration: Works with existing DistilBERT models

🤖 LLM Model Selection for Reasoning

LLM Model Size vs Performance

LLM Speed vs Quality

🎮 Demo and Testing Tools

Once you have a trained model, use these tools to test and demonstrate fraud detection capabilities:

1. fraud_detection_demo.ipynb (Recommended)

Type: Interactive Jupyter Notebook
Location: demos/fraud_detection_demo.ipynb
Best for: Exploratory testing, visualizations, learning
Features:
- Step-by-step model loading
- Interactive prediction cells
- Sample test cases for all fraud types
- Visualizations and analysis
- Batch prediction capabilities
- Model information display

2. fraud_detection_demo.py

Type: Comprehensive Python script
Location: demos/fraud_detection_demo.py
Best for: Integration into applications, command-line use
Features:
- Full-featured demo class
- Interactive terminal interface
- Sample test runner
- Single and batch predictions
- Production-ready code structure

3. quick_demo.py

Type: Simple test script
Location: demos/quick_demo.py
Best for: Quick verification that your model works
Features:
- Fast model loading test
- 5 sample predictions
- Basic accuracy check
- Minimal dependencies

🎯 Fraud Types Detected

Your trained model can detect these 9 classes:

legitimate - Normal, safe messages
phishing - Attempts to steal credentials/personal info
tech_support_scam - Fake technical support
reward_scam - Fake prizes/lottery winnings
job_scam - Fraudulent employment opportunities
sms_spam - Unwanted promotional messages
popup_scam - Fake security alerts
refund_scam - Fake refund/billing notifications
ssn_scam - Social Security number theft attempts

💡 Demo Usage Examples

Single Prediction

from demos.fraud_detection_demo import FraudDetectionDemo

demo = FraudDetectionDemo()
result = demo.predict_single("Your account has been compromised! Click here now!")
print(f"Prediction: {result['predicted_class']}")
print(f"Confidence: {result['confidence']:.4f}")
print(f"Is Fraud: {result['is_fraud']}")

Batch Prediction

texts = [
    "Meeting at 3 PM tomorrow",
    "URGENT: Verify your SSN now!",
    "You won $10,000! Send fee to claim"
]

results = demo.predict_batch(texts)
for result in results:
    print(f"{result['predicted_class']}: {result['text']}")

Interactive Jupyter Demo

# In demos/fraud_detection_demo.ipynb
your_text = "Your Netflix subscription has expired. Update your payment method to continue watching."
result = predict_fraud(your_text)
display_prediction(result)

📈 Expected Performance

Based on similar projects and baseline implementations:

Model Type	Expected Accuracy	F1-Score	Notes
Simple Rule-Based	60-70%	0.6-0.7	Quick prototype
TF-IDF + LogReg	80-90%	0.8-0.9	Good baseline
TF-IDF + SVM	80-90%	0.8-0.9	Robust to noise
BERT Fine-tuned	90-95%	0.9-0.95	Best performance
DistilBERT Fine-tuned	89-94%	0.89-0.94	60% faster, 97% of BERT performance

Demo Troubleshooting

Model Not Loading

Check that models/bert_model/ and models/bert_tokenizer/ exist (for BERT)
Check that models/distilbert_model/ and models/distilbert_tokenizer/ exist (for DistilBERT)
Verify you downloaded the complete model from Kaggle
Ensure all required packages are installed: pip install torch transformers pandas numpy matplotlib seaborn

Low Performance

🎯 Check if your test data matches the training distribution
🎯 Consider retraining with more diverse examples
🎯 Adjust confidence thresholds for your specific needs

Memory Issues

💾 Reduce batch size in predict_batch()
💾 Use CPU instead of GPU if memory is limited
💾 Process smaller chunks of data at a time

Customization Tips

Confidence thresholds: Consider predictions with confidence < 0.5 as uncertain
Adding custom test cases: Edit the sample test cases in the demo files
Integration: Use the FraudDetectionDemo class as a starting point for applications

🔧 Configuration

Traditional ML Parameters

# In training/baseline_fraud_detection.py
vectorizer = TfidfVectorizer(
    max_features=5000,    # Vocabulary size
    stop_words='english', # Remove common words
    ngram_range=(1, 2)    # Use unigrams and bigrams
)

BERT Configuration

# In training/bert_fraud_detection.py
classifier = BERTFraudClassifier(
    model_name='bert-base-uncased',  # Or 'distilbert-base-uncased' for faster training
    max_length=128,                  # Maximum sequence length
    num_classes=2                    # Binary classification
)

DistilBERT Configuration (Recommended for Production)

# In training/kaggle_fraud_detection.ipynb
model = DistilBertForSequenceClassification.from_pretrained(
    'distilbert-base-uncased', 
    num_labels=10                    # Multiclass classification (9 fraud types + legitimate)
)
batch_size = 16      # Can use larger batches due to lower memory usage
max_length = 128     # Maximum sequence length
epochs = 3          # Faster training allows more epochs
learning_rate = 2e-5 # DistilBERT learning rate

Kaggle Training Configuration

# In runs/fraud-detection-kaggle-training-bert-run.ipynb
batch_size = 16      # Adjust based on GPU memory
max_length = 128     # Maximum sequence length
epochs = 3          # Training epochs
learning_rate = 2e-5 # BERT learning rate

📊 Sample Results

Traditional ML Output:

LOGISTIC_REGRESSION:
  Accuracy: 0.889
  F1-Score: 0.889
  AUC Score: 0.944

SVM:
  Accuracy: 0.889
  F1-Score: 0.889
  AUC Score: 0.944

BERT Output:

BERT Evaluation Results:
              precision    recall  f1-score   support

      normal       0.92      0.92      0.92        38
       fraud       0.92      0.92      0.92        37

    accuracy                           0.92        75
   macro avg       0.92      0.92      0.92        75
weighted avg       0.92      0.92      0.92        75

DistilBERT Output (Multiclass):

DistilBERT Multiclass Evaluation Results:
                    precision    recall  f1-score   support

         job_scam       0.89      0.94      0.91        32
       legitimate       0.95      0.91      0.93        45
         phishing       0.92      0.90      0.91        41
       popup_scam       0.88      0.92      0.90        38
      refund_scam       0.91      0.88      0.89        34
      reward_scam       0.90      0.93      0.91        36
         sms_spam       0.93      0.89      0.91        43
         ssn_scam       0.87      0.91      0.89        35
tech_support_scam       0.94      0.89      0.91        37

         accuracy                           0.91       341
        macro avg       0.91      0.91      0.91       341
     weighted avg       0.91      0.91      0.91       341

📊 DistilBERT Overall Metrics:
Accuracy: 0.9120
F1-Score (Macro): 0.9088
F1-Score (Weighted): 0.9115

📋 Data Requirements

Current Implementation

Uses synthetic sample data for demonstration
15 fraud messages + 15 normal messages
Easily expandable with real datasets

Recommended Real Datasets

SMS Spam Collection - Classic spam detection
Phishing URL Dataset - URL-based fraud detection
Enron Email Dataset - Email fraud detection
Social Media Scam Data - Social platform fraud

Data Format Expected

data = pd.DataFrame({
    'message': ['text content here', ...],
    'label': ['fraud', 'normal', ...]
})

🛠️ Extending the Models

Adding New Features

# Add sentiment analysis
from textblob import TextBlob

def add_sentiment_features(text):
    blob = TextBlob(text)
    return {
        'polarity': blob.sentiment.polarity,
        'subjectivity': blob.sentiment.subjectivity
    }

Custom Preprocessing

def custom_preprocess(text):
    # Add domain-specific preprocessing
    text = remove_urls(text)
    text = normalize_currency(text)
    return text

Ensemble Methods

# Combine multiple models
def ensemble_predict(text):
    lr_pred = lr_model.predict_proba(text)[0][1]
    svm_pred = svm_model.predict_proba(text)[0][1]
    bert_pred = bert_model.predict(text)['probabilities']['fraud']
    
    # Weighted average
    final_score = 0.3 * lr_pred + 0.3 * svm_pred + 0.4 * bert_pred
    return 'fraud' if final_score > 0.5 else 'normal'

🌐 Deployment Options

Using the Demo Framework

# Production-ready integration using the demo class
from demos.fraud_detection_demo import FraudDetectionDemo

detector = FraudDetectionDemo()

# Single prediction
result = detector.predict_single(user_message)
if result['is_fraud'] and result['confidence'] > 0.8:
    alert_user(result['predicted_class'])

Flask Web App

from flask import Flask, request, jsonify
from demos.fraud_detection_demo import FraudDetectionDemo

app = Flask(__name__)
detector = FraudDetectionDemo()

@app.route('/predict', methods=['POST'])
def predict():
    text = request.json['text']
    result = detector.predict_single(text)
    return jsonify({
        'prediction': result['predicted_class'],
        'is_fraud': result['is_fraud'],
        'confidence': result['confidence']
    })

Streamlit Dashboard

import streamlit as st
from demos.fraud_detection_demo import FraudDetectionDemo

st.title("🛡️ Fraud Detection System")
detector = FraudDetectionDemo()

text_input = st.text_area("Enter message to analyze:")

if st.button("Analyze"):
    result = detector.predict_single(text_input)
    
    if result['is_fraud']:
        st.error(f"🚨 FRAUD DETECTED: {result['predicted_class']}")
    else:
        st.success("✅ Message appears legitimate")
    
    st.write(f"Confidence: {result['confidence']:.2%}")
    
    # Show probability distribution
    st.bar_chart(result['all_probabilities'])

🔍 Evaluation Metrics

The models are evaluated using:

Accuracy: Overall correctness
Precision: True positives / (True positives + False positives)
Recall: True positives / (True positives + False negatives)
F1-Score: Harmonic mean of precision and recall
AUC-ROC: Area under the ROC curve

For fraud detection, Recall is often most important (don’t miss actual fraud).

🚧 Known Limitations

Sample Data: Currently uses synthetic data; real datasets needed for production
Class Imbalance: Real fraud data is typically very imbalanced
Context: Simple models may miss contextual nuances
Adversarial Examples: Sophisticated scammers may evade detection
Language Support: Currently English-only

🔮 Next Steps

For Model Development

Data Collection: Gather real fraud/scam datasets
Feature Engineering: Add metadata features (sender, timestamp, etc.)
Advanced Models: Experiment with RoBERTa, DistilBERT (already implemented), or domain-specific models
Active Learning: Implement feedback loop for continuous improvement
Multi-modal: Combine text with image analysis for comprehensive detection

For Production Deployment

Performance Optimization: Optimize for low-latency inference using the demo framework
A/B Testing: Compare model performance in production using demo tools
Real-time Processing: Integrate demo classes into streaming systems
Monitoring: Use demo tools to validate model performance over time
User Interface: Build on the Streamlit demo for user-facing applications

For Demo Enhancement

Interactive Web App: Extend the Streamlit demo with more features
API Development: Use the demo classes to build REST APIs
Batch Processing: Implement large-scale batch prediction capabilities
Model Comparison: Add functionality to compare multiple model versions
Feedback Collection: Integrate user feedback mechanisms for continuous learning

📚 References

Based on analysis of existing projects including:

BERT Mail Classification
Fine-Tuning BERT for Phishing URL Detection
Spam-T5: Benchmarking LLMs for Email Spam Detection
Various Kaggle fraud detection competitions

🔗 Quick Reference

Training Files

training/baseline_fraud_detection.py - Traditional ML models
training/bert_fraud_detection.py - BERT training script
runs/fraud-detection-kaggle-training-bert-run.ipynb - Kaggle BERT training notebook

Demo Files

demos/fraud_detection_demo.ipynb - Interactive demo notebook
demos/fraud_detection_demo.py - Full demo script
demos/quick_demo.py - Quick verification

Model Files (after training)

models/bert_model/ - Trained BERT model
models/bert_tokenizer/ - BERT tokenizer
models/distilbert_model/ - Trained DistilBERT model (60% faster)
models/distilbert_tokenizer/ - DistilBERT tokenizer
models/model.zip - Compressed model bundle (excluded from git)

Commands

# Quick test
python demos/quick_demo.py

# Interactive demo
jupyter notebook demos/fraud_detection_demo.ipynb

# Full demo
python demos/fraud_detection_demo.py

# Train from scratch
python training/baseline_fraud_detection.py

🤝 Contributing

Fork the repository
Create a feature branch
Add your improvements
Submit a pull request

Note: This is a baseline implementation. For production use, consider:

Larger, diverse datasets
More sophisticated preprocessing
Ensemble methods
Regular model retraining
Bias and fairness considerations