Dan Lecocq
Verified Expert in Engineering
Software Developer
Dan is an engineer and cowboy coder with a background in big data and distributed systems. He has extensive experience with profiling, optimization, asynchronous network I/O, and getting huge amounts of work pushed through a pipeline reliably and efficiently.
Portfolio
Experience
Availability
Preferred Environment
JavaScript, Ruby, C++, Python, Git, Linux
The most amazing...
...thing I've coded is a system to crawl and index hundreds of millions of tweeted URLs within 10 minutes of being tweeted.
Work Experience
Senior Software Engineer
Moz
- Rewrote a service recursively crawling customer sites and analyzing and reporting SEO issues.
- Wrote a queueing system (qless) that has been widely adopted internally for externally for production systems.
- Designed and implemented a service for crawling and indexing pages discovered through important RSS feeds.
- Helped to implement an algorithm to remove navigation, headers, and footers from web content for the purposes of indexing (eventually published).
- Wrote a number of web crawlers for different purposes, contributing many well-used open source projects along the way to the state of the art of web crawling.
- Crawled and processed tens of billions of pages across all my various crawlers.
- Worked to support our next generation of backlinks indexing infrastructure.
Graduate Researcher
IBM Research
- Collaborated between KAUST's supercomputing department and IBM Research.
- Augmented a computational steering library to work with WebSockets.
- Included work with Lawrence Berkeley National Lab to eventually support streaming visualization.
- Targeted KAUST's supercomputing infrastructure, an IBM BlueGene/P.
- Worked to enable researchers to examine, monitor, and update parameters of running simulations.
Experience
Shovel
https://github.com/seomoz/shovelqless
https://github.com/seomoz/qlesssimhash-py
https://github.com/seomoz/simhash-pypyreBloom
https://github.com/seomoz/pyreBloomdragnet
https://github.com/seomoz/dragnetSkills
Languages
Python, JavaScript, C++, Java, Lua, Ruby
Paradigms
Distributed Programming, Concurrent Programming, Test-driven Development (TDD)
Platforms
Linux
Storage
Amazon S3 (AWS S3), Redis, Elasticsearch, HBase, NSQL, MySQL
Other
Open Source, WebSockets
Libraries/APIs
Gevent, jQuery
Tools
Git
Education
Master's Degree in Applied Mathematics and Computational Science
King Abdullah University of Science and Technology - Thuwal, Saudi Arabia
Bachelor's Degree in Computer Science
Colorado School of Mines - Golden, CO
How to Work with Toptal
Toptal matches you directly with global industry experts from our network in hours—not weeks or months.
Share your needs
Choose your talent
Start your risk-free talent trial
Top talent is in high demand.
Start hiring