📦 Data Source


1 🛒 Dataset Overview

The dataset used in this project comes from the UCI Machine Learning Repository and includes data from 12,330 individual shopping sessions.

Each session represents a unique user over the span of a one-year period. This design avoids biases caused by holidays, special events, or marketing campaigns.


3 🧠 Why This Dataset?

  • Based on real user behavior
  • Includes key features like:
    • Time spent on pages
    • Product views
    • Bounce and exit rates
    • Visitor type and session details
  • Useful for modeling purchase prediction

4 👩‍🏫 Dataset Authors

  • C. Okan Sakar
    Department of Computer Engineering
    Bahcesehir University, Istanbul, Turkey

  • Yomi Kastro
    Inveon Information Technologies
    Istanbul, Turkey


5 📚 Citation

If you use this dataset, please cite:

Sakar, C.O., Polat, S.O., Katircioglu, M. et al.
Neural Computing and Applications (2018).
View Paper

Also:

Dua, D. and Graff, C. (2019)
UCI Machine Learning Repository
Irvine, CA: University of California, School of Information and Computer Science.