Update Weekly Blog Posts: 2021 authored by hackhard's avatar hackhard
Hello ## Jun 14:
\ No newline at end of file The current logic as of now is this:
[![](https://mermaid.ink/img/eyJjb2RlIjoiJSUgRW5hYmxlIEpTIHRvIHNlZSB0aGlzIFxuJSUgVXNlIG9mIGRhc2hlZCBsaW5lcyBhbmQgYm94ZXMgc2hvdyB0aGUgdGhpbmdzIHRoYXQgaGF2ZW4ndCBiZWVuIGltcGxlbWVudGVkIGFzIG9mIG5vdy4gXG5ncmFwaCBURFxuc3ViZ3JhcGggRmV0Y2hcbkFbRmV0Y2ggdXJsIHVzaW5nIFRvciBjbGllbnRdXG5CW0ZldGNoIHVybCB1c2luZyBOb24tVG9yIGNsaWVudF1cbmVuZFxuc3ViZ3JhcGggRE9NIEFuYWx5c2lzXG5FO0UxO0UyO0c7RzE7RzI7SDtIMjtYXG5lbmRcbkFbRmV0Y2ggdXJsIHVzaW5nIFRvciBjbGllbnRdIC0tPiBDe0lzIHRoZSA8YnI-IHN0YXR1cyBjb2RlIDxicj4gc2FtZT99LS1ZRVM6IDxicj4gTm8gcmVkaXJlY3Rpb24sIERPTSBjaGVja3MgcmVxdWlyZWQtLT5EMVtSZW1vdmUgR0RQUiBwb3B1cHNdXG5CW0ZldGNoIHVybCB1c2luZyBOb24tVG9yIGNsaWVudF0gLS0-Qy0tTk8tLS0-RHtjaGVjayB3aGV0aGVyIDxicj4gdG9yIHJldHVybnMgNHh4IG9yIDV4eCA8YnI-ZXJyb3IgY29kZXN9LS1ZRVMtLS0tLS0tLS0tPkYoVG9yIEJsb2NrIEVycm9yKVxuRC0tTk86IDxicj4gQ291bGQgYmUgR0RQUiwgcmVkaXJlY3Rpb24sIENhcHRjaGEtLT5FNC0uTm8tLT5EMS0tPkVbL0FkZGl0aW9uYWwgVGVzdHMgPGJyPiBET00gY2hlY2tzL11cbkUtLT5FMVtET00gQ2hlY2tzIDxicj4gUGVyY2VudGFnZSBvZiBkaWZmZXJuY2UgaW4gRE9NIG5vZGVzXVxuRS0uLT5FMltDb25zZW5zdXMgTW9kdWxlXVxuRS0tLS0tLT5FM1tDYXB0Y2hhIENoZWNrXSAlJSBUaGlzIHdpbGwgdXNlIHRoZSBmYWN0IG9mIHRoZSByZXF1ZXN0IHBhdGggY29udGFpbmluZyBjYXB0Y2hhIGluIHRoZSB1cmwgaXRzZWxmXG5FNHtJZiB0aGUgPGJyPiBSZWRpcmVjdGVkIHdlYnNpdGUgPGJyPnJldHVybnMgZXJyb3J9LS15ZXMuLT5GXG5FMS0tPkd7aWYgPGJyPiBzY29yZSA-IDB9XG5HLS1OTywgaWYgc2NvcmUgPSAwLS0tPlgoTWF0Y2hlZCE8YnI-bm8gZXJyb3JzKVxuRy0tWUVTLS0-RzF7aWYgPGJyPiBzY29yZSA-IEslfS0tWUVTLS0tLT5IMihUb3IgcmV0dXJucyBFcnJvciA8YnI-IG9yIGluIHNvbWUgY2FzZXMsIGRlbm90ZXMgYSBkaWZmZXJlbnQgcGFnZSkgJSUgRm9yIG1vc3QgY2FzZXMgaXQgcmV0dXJucyBlcnJvciBvciBpdCBtaWdodCBiZSBwb3NzaWJsZSB0aGF0IHRoZSBwYWdlIGhhc24ndCBiZWVuIGxvYWRlZC5cbkcxLS1OTy0tPkcyKEZpbHRlciBsaXN0KVxuRzEtLk5vLS4uLT5FMlxuRy0tTk8tLS0tLT5IW0Rlbm90ZXMgUG9wLXVwcywgPGJyPm9yIGluIHNvbWUgY2FzZXMgPGJyPndoZW4gdGhlIGRpZmZlcmVuY2UgaXMgbG90IGluIG5lZ2F0aXZlIHRlcm1zLSB0YmIgPiBuYmI8YnI-ZGVub3RlcyBhbm90aGVyIHBhZ2UsIHdoaWNoIG1pZ2h0IGhhdmUgbW9yZSBET00gbm9kZXMuXVxuQS0tLS0tLS0tLS0tLS0-UShXZWJzaXRlcyB3aXRob3V0IGVycm9yLCBidXQgZGlmZmVyZW50IHBhZ2VzKVxuXG5cblxuY2xpY2sgUSBocmVmIFwiaHR0cDovL3d3dy5kb21pbm9zLmNvbVwiIF9ibGFua1xuJSUgc3R5bGUgRDEgc3Ryb2tlLXdpZHRoOjFweCxzdHJva2UtZGFzaGFycmF5OiA1IDhcbiUlIHN0eWxlIEU0IHN0cm9rZS13aWR0aDoxcHgsc3Ryb2tlLWRhc2hhcnJheTogNSA4XG4lJSBzdHlsZSBFMyBzdHJva2Utd2lkdGg6MXB4LHN0cm9rZS1kYXNoYXJyYXk6IDUgOFxuc3R5bGUgRTIgc3Ryb2tlLXdpZHRoOjFweCxzdHJva2UtZGFzaGFycmF5OiA1IDhcbnN0eWxlIEYgc3Ryb2tlLXdpZHRoOjNweCxmaWxsOiNmMDQiLCJtZXJtYWlkIjp7InRoZW1lIjoibmV1dHJhbCJ9LCJ1cGRhdGVFZGl0b3IiOmZhbHNlLCJhdXRvU3luYyI6dHJ1ZSwidXBkYXRlRGlhZ3JhbSI6ZmFsc2V9)](https://mermaid-js.github.io/mermaid-live-editor/edit##eyJjb2RlIjoiJSUgRW5hYmxlIEpTIHRvIHNlZSB0aGlzIFxuJSUgVXNlIG9mIGRhc2hlZCBsaW5lcyBhbmQgYm94ZXMgc2hvdyB0aGUgdGhpbmdzIHRoYXQgaGF2ZW4ndCBiZWVuIGltcGxlbWVudGVkIGFzIG9mIG5vdy4gXG5ncmFwaCBURFxuc3ViZ3JhcGggRmV0Y2hcbkFbRmV0Y2ggdXJsIHVzaW5nIFRvciBjbGllbnRdXG5CW0ZldGNoIHVybCB1c2luZyBOb24tVG9yIGNsaWVudF1cbmVuZFxuc3ViZ3JhcGggRE9NIEFuYWx5c2lzXG5FO0UxO0UyO0c7RzE7RzI7SDtIMjtYXG5lbmRcbkFbRmV0Y2ggdXJsIHVzaW5nIFRvciBjbGllbnRdIC0tPiBDe0lzIHRoZSA8YnI-IHN0YXR1cyBjb2RlIDxicj4gc2FtZT99LS1ZRVM6IDxicj4gTm8gcmVkaXJlY3Rpb24sIERPTSBjaGVja3MgcmVxdWlyZWQtLT5EMVtSZW1vdmUgR0RQUiBwb3B1cHNdXG5CW0ZldGNoIHVybCB1c2luZyBOb24tVG9yIGNsaWVudF0gLS0-Qy0tTk8tLS0-RHtjaGVjayB3aGV0aGVyIDxicj4gdG9yIHJldHVybnMgNHh4IG9yIDV4eCA8YnI-ZXJyb3IgY29kZXN9LS1ZRVMtLS0tLS0tLS0tPkYoVG9yIEJsb2NrIEVycm9yKVxuRC0tTk86IDxicj4gQ291bGQgYmUgR0RQUiwgcmVkaXJlY3Rpb24sIENhcHRjaGEtLT5FNC0uTm8tLT5EMS0tPkVbL0FkZGl0aW9uYWwgVGVzdHMgPGJyPiBET00gY2hlY2tzL11cbkUtLT5FMVtET00gQ2hlY2tzIDxicj4gUGVyY2VudGFnZSBvZiBkaWZmZXJuY2UgaW4gRE9NIG5vZGVzXVxuRS0uLT5FMltDb25zZW5zdXMgTW9kdWxlXVxuRS0tLS0tLT5FM1tDYXB0Y2hhIENoZWNrXSAlJSBUaGlzIHdpbGwgdXNlIHRoZSBmYWN0IG9mIHRoZSByZXF1ZXN0IHBhdGggY29udGFpbmluZyBjYXB0Y2hhIGluIHRoZSB1cmwgaXRzZWxmXG5FNHtJZiB0aGUgPGJyPiBSZWRpcmVjdGVkIHdlYnNpdGUgPGJyPnJldHVybnMgZXJyb3J9LS55ZXMuLT5GXG5FMS0tPkd7aWYgPGJyPiBzY29yZSA-IDB9XG5HLS1OTywgaWYgc2NvcmUgPSAwLS0tPlgoTWF0Y2hlZCE8YnI-bm8gZXJyb3JzKVxuRy0tWUVTLS0-RzF7aWYgPGJyPiBzY29yZSA-IEslfS0tWUVTLS0tLT5IMihUb3IgcmV0dXJucyBFcnJvciA8YnI-IG9yIGluIHNvbWUgY2FzZXMsIGRlbm90ZXMgYSBkaWZmZXJlbnQgcGFnZSkgJSUgRm9yIG1vc3QgY2FzZXMgaXQgcmV0dXJucyBlcnJvciBvciBpdCBtaWdodCBiZSBwb3NzaWJsZSB0aGF0IHRoZSBwYWdlIGhhc24ndCBiZWVuIGxvYWRlZC5cbkcxLS1OTy0tPkcyKEZpbHRlciBsaXN0KVxuRzEtLk5vLS4uLT5FMlxuRy0tTk8tLS0tLT5IW0Rlbm90ZXMgUG9wLXVwcywgPGJyPm9yIGluIHNvbWUgY2FzZXMgPGJyPndoZW4gdGhlIGRpZmZlcmVuY2UgaXMgbG90IGluIG5lZ2F0aXZlIHRlcm1zLSB0YmIgPiBuYmI8YnI-ZGVub3RlcyBhbm90aGVyIHBhZ2UsIHdoaWNoIG1pZ2h0IGhhdmUgbW9yZSBET00gbm9kZXMuXVxuQS0tLS0tLS0tLS0tLS0-UShXZWJzaXRlcyB3aXRob3V0IGVycm9yLCBidXQgZGlmZmVyZW50IHBhZ2VzKVxuXG5cblxuY2xpY2sgUSBocmVmIFwiaHR0cDovL3d3dy5kb21pbm9zLmNvbVwiIF9ibGFua1xuJSUgc3R5bGUgRDEgc3Ryb2tlLXdpZHRoOjFweCxzdHJva2UtZGFzaGFycmF5OiA1IDhcbiUlIHN0eWxlIEU0IHN0cm9rZS13aWR0aDoxcHgsc3Ryb2tlLWRhc2hhcnJheTogNSA4XG4lJSBzdHlsZSBFMyBzdHJva2Utd2lkdGg6MXB4LHN0cm9rZS1kYXNoYXJyYXk6IDUgOFxuc3R5bGUgRTIgc3Ryb2tlLXdpZHRoOjFweCxzdHJva2UtZGFzaGFycmF5OiA1IDhcbnN0eWxlIEYgc3Ryb2tlLXdpZHRoOjNweCxmaWxsOiNmMDQiLCJtZXJtYWlkIjoie1xuICBcInRoZW1lXCI6IFwibmV1dHJhbFwiXG59IiwidXBkYXRlRWRpdG9yIjpmYWxzZSwiYXV0b1N5bmMiOnRydWUsInVwZGF0ZURpYWdyYW0iOmZhbHNlfQ)
+ Use of dashed lines and boxes show the things that haven't been implemented as of now.
+ As of now K has been set to 150 (Experimental Analysis)
+ The Captcha Checking Module has been proposed recently, which enables the use of "captcha" in the requests path from the responses we get while we load a website.
For more details one you could look into the [Experimental code](https://raw.githubusercontent.com/Hackhard/Fetcher/main/status%20code/test_run4/tr.py) and it's [output](https://raw.githubusercontent.com/Hackhard/Fetcher/main/status%20code/test_run4/tr_bash_output) to gain an even more insight.
## Insights:
At present I'm checking for the reliability of the modules, like for example:
```Cloudflare blocks requests library and hence request library isn't suited in here. I read it's because the headers(User-Agent) of the request is sent by the name of python which get's marked as a bot. I changed the User-Agent but it was still the same so it isn't much of use in this case. Also for specific cases like mastercard where there is status 3xx (reload) it returns results easily.```
Also, I've been asked to not use selenium-wire, which I'll be changing soon.
![image](https://user-images.githubusercontent.com/34208125/121801062-0a2ceb80-cc53-11eb-9933-19df70791265.png)
So I plan on also making a checker method that would check the following:
```Python
def check():
# Non Tor:
if request_module is blocked:
# Just to be cautious
check HAR
if HAR.first() returns 4xx or 5xx:
go with request
elif HAR.first() returns 0:
"No case found till now"
else:
go with HAR.first()
# Tor:
if request_module is blocked:
check HAR
if HAR.first() returns 3xx or 4xx or 5xx:
go with request
if HAR.first() returns 0:
"check for captcha and warnings"
pass
else:
go with HAR.first()
```
I hope this would tend to make the code a bit better in terms of reliabilty. Discussions needed in here because it's my thought as of now.
Also HAR.first() mean the first request status code sent to server. Generally the index page.
\ No newline at end of file