What is the best way to sort a terabyte of array of data, when you have limited RAM (500k), and each array element has a couple of items of data, at about 1-10k each.


External merge sort. First load the data chunk by chunk, sort it in-memory (by quicksort for example) and write it back in chunks. After this, use merge sort on the chunks. This is how most databases do sorting.

How is this relevant to a product manager role? This is an engineering problem

External merge sort is the answer

Shoot me I anyone ever asks me this an interview.

hi, can u plzzz answer the question....

You had to know sorting, memory management, Big-O calculations, and data structures.

