Infinity Crawler alternatives and similar packages
Based on the "Tools" category.
Alternatively, view Infinity Crawler alternatives based on common mentions on social networks and blogs.
-
ShareX
ShareX is a free and open source program that lets you capture or record any area of your screen and share it with a single press of a key. It also allows uploading images, text or other types of files to many supported destinations you can choose from. -
Another Redis Desktop Manager
๐๐๐A faster, better and more stable redis desktop manager [GUI client], compatible with Linux, Windows, Mac. What's more, it won't crash when loading massive keys. -
YARP
A toolkit for developing high-performance HTTP reverse proxy applications. -
Visual Studio Uninstaller
Uninstall and clean up all components of Visual Studio. -
OctoLinker
OctoLinker โ Links together, what belongs together -
NETworkManager
A powerful tool for managing networks and troubleshoot network problems! -
Open Live Writer
An open source fork of Windows Live Writer -
UnitsNet
Makes life working with units of measurement just a little bit better. -
Myrtille
A native HTML4 / HTML5 Remote Desktop Protocol and SSH client -
Netling
Netling is a load tester client for easy web testing. -
Downloader
Fast, cross-platform and reliable multipart downloader with asynchronous progress events for .NET applications. -
SmartCode
SmartCode = IDataSource -> IBuildTask -> IOutput => Build Everything!!! -
Fake JSON Server
Fake JSON Server is a Fake REST API that can be used as a Back End for prototyping or as a template for a CRUD Back End. -
Mockaco
๐ต HTTP mock server, useful to stub services and simulate dynamic API responses, leveraging ASP.NET Core features, built-in fake data generation and pure C# scripting -
Workflow Server
Workflow Server is a ready-to-use Workflow Engine-based application that you can deploy into your infrastructure. It can be integrated with NodeJS, PHP, Ruby, .NET, or Java applications via a REST API. Workflow Server is a key component for managing the lifecycle of business objects within your enterprise. -
FontAwesomeNet
Font-Awesome for .NET(Windows Forms and WPF). -
AnyStatus
A remote control for your CI/CD pipelines and more -
posh-dotnet
PowerShell tab completion and tooltip support for the dotnet CLI. -
JSON Formatter and Validator
A blazing fast JSON formatter and validator that won't share JSON with a server. -
CatLight
Build status notifications for TFS/Jenkins/Travis/Appveyor. Cross-platform desktop app based on .Net Core and Electron. [Free][Proprietary] -
Fiddler
The free web debugging proxy for any browser, system or platform
ONLYOFFICE Docs โ document collaboration in your environment
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.
Do you think we are missing an alternative of Infinity Crawler or a related project?
README
[Icon](images/icon.png)
Infinity Crawler
A simple but powerful web crawler library for .NET
Features
- Obeys robots.txt (crawl delay & allow/disallow)
- Obeys in-page robots rules (
X-Robots-Tag
header and<meta name="robots" />
tag) - Uses sitemap.xml to seed the initial crawl of the site
- Built around a parallel task
async
/await
system - Swappable request and content processors, allowing greater customisation
- Auto-throttling (see below)
Licensing and Support
Infinity Crawler is licensed under the MIT license. It is free to use in personal and commercial projects.
There are support plans available that cover all active Turner Software OSS projects. Support plans provide private email support, expert usage advice for our projects, priority bug fixes and more. These support plans help fund our OSS commitments to provide better software for everyone.
Polite Crawling
The crawler is built around fast but "polite" crawling of website. This is accomplished through a number of settings that allow adjustments of delays and throttles.
You can control:
- Number of simulatenous requests
- The delay between requests starting (Note: If a
crawl-delay
is defined for the User-agent, that will be the minimum) - Artificial "jitter" in request delays (requests seem less "robotic")
- Timeout for a request before throttling will apply for new requests
- Throttling request backoff: The amount of time added to the delay to throttle requests (this is cumulative)
- Minimum number of requests under the throttle timeout before the throttle is gradually removed
Other Settings
- Control the UserAgent used in the crawling process
- Set additional host aliases you want the crawling process to follow (for example, subdomains)
- The max number of retries for a specific URI
- The max number of redirects to follow
- The max number of pages to crawl
Example Usage
using InfinityCrawler;
var crawler = new Crawler();
var result = await crawler.Crawl(new Uri("http://example.org/"), new CrawlSettings {
UserAgent = "MyVeryOwnWebCrawler/1.0",
RequestProcessorOptions = new RequestProcessorOptions
{
MaxNumberOfSimultaneousRequests = 5
}
});
*Note that all licence references and agreements mentioned in the Infinity Crawler README section above
are relevant to that project's source code only.