And some more:
Content-Type: image / png
Content-Type: image/$JPG
Content-Type: image%2Fjpeg
Content-Type: images/gif
Content-Type: max-age=1555200
Content-Type: plain/txt
Content-Type: test/plain
Content-Type: text/htmml
Content-Type: text/javasciprt
Content-Type: text/javascriipt
Content-Type: text\html
Content-Type: type
Content-Type: TYPE/SUBTYPE
Content-Type: UTC
Content-Type: UTF-8
Content-Type: width="1280" height="720"
Recently I've been building reports based upon HTTP Archive data. Rather than call BigQuery, I instead export the data I'm interested in into Parquet format and then query it locally on my laptop using DuckDB. Here's how I did it: https://discuss.httparchive.org/t/querying-the-http-archive-with-duckdb/2568
#httpArchive #parquet #DuckDB
And my favourite is:
Last-Modified: Invalid Date
... which was seen on 119 responses from the #httpArchive 2023-04-01 mobile run.
#http
And my favourite is:
Last-Modified: Invalid Date
... which was seen on 119 responses from the #httpArchive 2023-04-01 mobile run.
#http
Is this common? I queried the HTTP Archive and it found 121,058 double weak resources in the 2022-12-01 dataset with ETags that start with W/W/. That's 0.008% of all the resources. Good news: I found no triple weak validators.
#httpArchive