13 November 2021

Algorithms and Data Structures for Massive Datasets

Technical Reviewer:

Dzejla Medjedovic, Emin Tahirovic, and Ines Dedovic
MEAP began July 2020 Publication in January 2022 (estimated)
ISBN 9781617298035 325 pages (estimated) printed in black & white

https://www.manning.com/books/algorithms-and-data-structures-for-massive-datasets



16 September 2021

netcat - nc

netcat
ncat
nc


-n : skip DNS lookups
-u : Use of UDP mode (instead of TCP)
-v : Extensive output
-w : timeout (seconds)
-z : Port scanner mode (zero I/O mode); only listening services are scanned (no data is sent)

Scan port 123 for NTP:
nc -z -v -u 0.us.pool.ntp.org 123


scan ports:
nc -w 2 -z 192.168.10.1 1-1024

nc -v -n 8.8.8.8 1-1000




30 July 2021

22 July 2021

AWS Serverless Data Lake Framework

SDLF is a peculiar framework developed by AWS as a Data Lake using their services.

It is interesting how it creates two pipelines where the data is parsed (reduced), process as a big ETL, where finally can be exploited by BI products.

  • AWS Glue
  • S3
  • Lambda


https://sdlf.readthedocs.io/en/latest/index.html

https://sdlf.workshop.aws/en/

https://github.com/awslabs/aws-serverless-data-lake-framework



Blog Archive

Disclaimer

Qux