Re: add an auto_increment column

capitnfrakass Mon, 07 Feb 2022 18:11:36 -0800

Hello,

For this query:

df.select("*").orderBy("amount",ascending=False).show()

+------+------+
| fruit|amount|
+------+------+
|tomato|     9|
| apple|     6|
|cherry|     5|
|orange|     3|
+------+------+

I want to add a column "top", in which the value is: 1,2,3... meaningtop1, top2, top3...


How can I do it?

Thanks.




On 07/02/2022 21:18, Gourav Sengupta wrote:

Hi,

can we understand the requirement first?

What is that you are trying to achieve by auto increment id? Do you
just want different ID's for rows, or you may want to keep track of
the record count of a table as well, or do you want to do use them for
surrogate keys?

If you are going to insert records multiple times in a table, and
still have different values?

I think without knowing the requirements all the above responses, like
everything else where solutions are reached before understanding the
problem, has high chances of being wrong.

Regards,
Gourav Sengupta

On Mon, Feb 7, 2022 at 2:21 AM Siva Samraj <samraj.mi...@gmail.com>
wrote:

Monotonically_increasing_id() will give the same functionality

On Mon, 7 Feb, 2022, 6:57 am , <capitnfrak...@free.fr> wrote:

For a dataframe object, how to add a column who is auto_increment
like
mysql's behavior?

Thank you.

---------------------------------------------------------------------

To unsubscribe e-mail: user-unsubscr...@spark.apache.org


---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Re: add an auto_increment column

Reply via email to