写点什么

Tensorflow 小技巧 (一)

用户头像
毛显新
关注
发布于: 46 分钟前

how-do-i-select-rows-from-a-dataframe-based-on-column-values

To select rows whose column value equals a scalar, some_value, use ==:

df.loc[df['column_name'] == some_value]
复制代码

To select rows whose column value is in an iterable, some_values, use isin:

df.loc[df['column_name'].isin(some_values)]
复制代码

how-do-i-sort-a-dictionary-by-value

x = {1: 2, 3: 4, 4: 3, 2: 1, 0: 0}dict(sorted(x.items(), key=lambda item: item[1]))
复制代码

how-can-i-count-the-occurrences-of-a-list-item

from collections import Counterl = ["a","b","b"]Counter(l)
复制代码

pandas.DataFrame.drop_duplicates

df = pd.DataFrame({...     'brand': ['Yum Yum', 'Yum Yum', 'Indomie', 'Indomie', 'Indomie'],...     'style': ['cup', 'cup', 'cup', 'pack', 'pack'],...     'rating': [4, 4, 3.5, 15, 5]... })df.drop_duplicates(subset=['brand'])
复制代码

tf.data.Dataset-----as_numpy_iterator()

Returns an iterator which converts all elements of the dataset to numpy.

dataset = tf.data.Dataset.from_tensor_slices([1, 2, 3])for element in dataset.as_numpy_iterator():  print(element)
复制代码

tf.data.Dataset

The tf.data.Dataset API supports writing descriptive and efficient input pipelines. Dataset usage follows a common pattern:

  1. Create a source dataset from your input data.

  2. Apply dataset transformations to preprocess the data.

  3. Iterate over the dataset and process the elements.

Iteration happens in a streaming fashion, so the full dataset does not need to fit into memory.

The simplest way to create a dataset is to create it from a python list:

dataset = tf.data.Dataset.from_tensor_slices([1, 2, 3])for element in dataset:  print(element)
复制代码

Once you have a dataset, you can apply transformations to prepare the data for your model:

dataset = tf.data.Dataset.from_tensor_slices([1, 2, 3])dataset = dataset.map(lambda x: x*2)list(dataset.as_numpy_iterator())
复制代码


用户头像

毛显新

关注

还未添加个人签名 2021.07.26 加入

还未添加个人简介

评论

发布
暂无评论
Tensorflow小技巧(一)