A Shein dataset sample of over 1000 products. Dataset was extracted using the Bright Data API.
product_name
: The name or title of the productdescription
: A textual description of the productinitial_price
: The original or starting price of the productfinal_price
: The current or final price of the product after any discounts or promotionscurrency
: The currency in which the prices are listedin_stock
: Indicates whether the product is currently in stock (True/False)color
: The color or colors available for the productsize
: The size or sizes available for the productreviews_count
: The number of reviews or ratings given by customers for the productmain_image
: The main image representing the productcategory_url
: The URL or link associated with the category of the producturl
: The URL or link to the product pagecategory_tree
: The hierarchical tree structure of categories to which the product belongscountry_code
: The country code indicating the country of sale or origindomain
: The domain or website where the product is listedimage_count
: The total number of images associated with the productimage_urls
: URLs pointing to images related to the productmodel_number
: The model number (SKU) associated with the productoffers
: Information about any special offers or deals associated with the productother_attributes
: Additional attributes or features of the productproduct_id
: A unique identifier or code associated with the productrating
: The average rating given by customers for the productrelated_products
: Information about other products related to the current productroot_category
: The root or top-level category to which the product belongstop_reviews
: Top or featured reviews for the productcategory
: The specific category to which the product belongsbrand
: The brand or brand name associated with the productall_available_sizes
: A list of all available sizes for each product
And a lot more.
This is a sample subset which is derived from the "Shein Products (public data)" dataset which includes more than 32,800,000 products.
Available dataset file formats: JSON, NDJSON, JSON Lines, CSV, or Parquet. Optionally, files can be compressed to .gz.
Dataset delivery type options: Email, API download, Webhook, Amazon S3, Google Cloud storage, Google Cloud PubSub, Microsoft Azure, Snowflake, SFTP.
Update frequency: Once, Daily, Weekly, Monthly, Quarterly, or Custom basis.
Data enrichment available as an addition to the data points extracted: Based on request.
Delve into product reviews and ratings to gauge consumer opinions and ensure your offerings align with market expectations. Use the Shein dataset to comprehend customer sentiment toward specific products or your brand as a whole, helping you refine your commercial strategies. Spot inventory gaps, detect rising demand for particular products, and identify consumer trends. The Shein dataset empowers companies to make strategic decisions in inventory management, optimize stocking levels, and streamline the supply chain for greater efficiency. Craft a robust pricing strategy by identifying similar products and categories within your competitors’ offerings. Leverage the Shein dataset to determine optimal pricing, uncover pricing gaps, and implement dynamic pricing models based on real-time market data.The Bright Initiative offers access to Bright Data's Web Scraper APIs and ready-to-use datasets to leading academic faculties and researchers, NGOs and NPOs promoting various environmental and social causes. You can submit an application here.