How to Protect Your Website From Bot Attacks

5 min readFeb 25, 2024

Introduction :

In today’s digital age, where the heartbeat of businesses resonates online, the threat landscape has evolved to include sophisticated and automated adversaries. Among these, bot attacks stand out as a formidable challenge, capable of wreaking havoc on companies of all sizes and industries.

As businesses increasingly rely on web applications, APIs, and online services, the risk of falling victim to malicious bots has become a pressing concern. Whether it’s the insidious spread of misinformation, therobot.txt relentless onslaught of spam, or the orchestrated attempts to compromise sensitive data, bot attacks pose a real and pervasive threat.

🌐💻🛡️Let’s make our website a no-go zone for malicious bots! 🌐💻🛡️

What are Bots?

Bots, short for robots, are software applications that perform automated online tasks. They can be programmed to perform various functions, ranging from simple and repetitive tasks to more complex actions. Bots can be beneficial or malicious, depending on their intended purpose.

Good Bots: Good bots are designed to perform helpful and constructive tasks. They serve a variety of positive purposes across different domains, they are usually defined in robot.txt file. Here are some examples:

Search Engine Crawlers: Search engines like Google, Bing, and others use bots to crawl and index web pages. These bots help in organizing and presenting information to users when they perform online searches.
Chatbots: Chatbots are automated programs that engage in conversation with users, providing information, assistance, or performing specific tasks. They are commonly used in customer support, virtual assistants, and online messaging platforms.
Web Scraping Bots for Data Collection: Some bots are designed for web scraping, collecting data from websites for legitimate purposes such as market research, price comparison, or aggregating information.
Monitoring Bots: Bots can monitor websites for changes, updates, or availability. This is beneficial for tracking competitive intelligence or ensuring the uptime of web services.
Social Media Bots for Engagement: Social media platforms use bots to facilitate engagement, such as liking posts, sharing content, and following users. These bots can enhance user experience and interaction.

Bad Bots: Bad bots are designed maliciously, often causing harm or disruption. They can be used for various nefarious purposes, including cyber attacks, fraud, and information theft. Here are examples of bad bots:

Web Scraping Bots for Content Theft: Bots may scrape websites to steal content, which can be used for unauthorized reproduction, plagiarism, or to create fake websites.
Credential Stuffing Bots: Bots can be employed to automate credential stuffing attacks, where stolen username and password combinations are systematically tested on various websites to gain unauthorized access.
Distributed Denial of Service (DDoS) Bots: DDoS bots flood a website or online service with traffic, overwhelming its resources and causing it to become slow or unavailable.
Spam Bots: Bots can be programmed to flood online forums, comment sections, or social media with spam messages, links, or advertisements.
Click Fraud Bots: Bots engage in click fraud by repeatedly clicking on online ads to exhaust the advertiser’s budget or promote fraudulent activities.
Impersonation Bots: Bots may impersonate legitimate users on social media or other platforms to spread misinformation, manipulate opinions, or engage in cyberbullying.

Bot-Control Managed Rule Group :

The managed rule group provides 2 protection levels

Common: Detects a variety of self-identifying bots and adds labels to them. It is ideal for customers with generic bot problems. Categorizing and verifying simple bots e.g bot:category:social_media such as Redditbot
Targeted: Detection based on client-side JavaScript interrogation, browser fingerprinting, captcha, and dynamic rate limiting. Protect against advanced bots that target specific applications by mimicking human traffic and changing the attack vectors to evade detection. It is ideal for customers who are getting targeted by advanced bots.

SCENARIO A: Prevent identified and unidentified bots.

Create a web ACL and associate ALB/CF with it.
Add AWS-managed rule group with Common Bot Control.
By default put the rule on block mode to identify and restrict the bots.
Attack by initiating a fake session with the Identity of “Amazonbot” instead of curl.
Attack by initiating a fake session with the Identity of “Mozilla/5.0” instead of curl.
Verify the sampled requests in WAF for BotControlRuleSet.
WAF should be able to detect fake sessions and block the bot requests.

curl -vkLso /dev/null -A "Amazonbot" -w "%(http_code) \n" https://yourwebsite.com | sort | uniq -c

curl -vklso /dev/null -A "mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:109.0) Gecko/20100101 Firefox/115.0" -w "%{http_code}\n" https://yourwebsite.com

SCENARIO B: Captch response for specific sensitive web pages.

Use the existing web ACL which was created in scenario A.
Add the custom rule to place CAPTCHA verification.
You can have this for specific sensitive pages like /payment.
Browse the https://yourwebsite.com/payment in the browser.
WAF should be able to ask you to solve the puzzle if you are human.
Attack the /payment page by initiating the fake session.
WAF should be able to identify you as BOT and block the request.

curl -vklso /dev/null -A "Mozilla/5.0 (Macintosh; Intel Mac OS X 10.15; rv:109.0) Gecko/20100101 Firefox/115.0" -w "%{http_code}\n" https://yourwebsite.com/payment

Testing and tuning :

Prepare for testing

Enable web ACL logging, Amazon Cloud Watch metrics, and web request sampling for the web ACL.
Set your protections to Count mode.
Associate the web ACL with a resource.

Monitoring and tuning

Monitor traffic and rule matches: Logs, Metrics and Sample Requests.
Configure mitigations to address false positives.
Correcting rule inspection criteria.
Correcting more complex problems: Add mitigating rule, scope-down statement, add a label match rule, change the version of managed rule group.

A few things to keep in mind

Rule order: AWS WAF rules should be ordered so that labels are used properly. Fine-grained rules should typically be positioned high in the WebACL.
Cost management: Use scope-down statements to manage Bot Control’s costs. Use rate-based rules as well as CAPTCHA to avoid abuse of ATP or CAPTCHA costs.

Reduce Bot Activities

Identify applications with high potential as bot targets.
Collect client-side signals.
Design applications to remove incentives.
Harden your SDKs.

Conclusion :

In the fast-paced digital landscape, where businesses thrive, the threat of bot attacks is real. From spreading misinformation to compromising data, bots are relentless. Yet, armed with knowledge and vigilance, businesses can stand strong.

Distinguish friend from foe, implement robust security, and stay alert. The battle against bots is ongoing, but with the right strategies, your digital fortress remains secure. As we journey towards a safer digital future, let resilience and proactive defence be your guiding lights.

🚀🛡️💻#DigitalSecurity #BotThreats #BotProtection🚀🛡️💻